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Foreword 


The preparation of this volume was supported in part by a grant from 
the Joseph P. Kennedy, Jr. Foundation. The idea of conducting a 
critical analysis of research on mental retardation in relation to theories 
of learning was proposed to me originally by Nicholas Hobbs on behalf 
of the Foundation. 1 was by no means entirely satisfied with my quali- 
fications for this assignment, having worked exclusively on one side 


of the relationship. However, the task offered an interesting challenge 
and it seemed to me that it would be worth trying to appraise what has 
been done and what might be done to exploit learning theory in this 
research area. In further discussions with Dr. Hobbs and communica- 
tions with Mr. Sargent Shriver, then Director of the Foundation, I agreed 
to undertake a critical analysis and to attempt to complete it within a 
year. The year in question, which was to have been 1966, stretched even- 
tually to three, primarily because none of us appreciated the extent to 
which the literature in this area is burgeoning. Further, it became in- 
creasingly evident to me as the work proceeded that research on mental 
retardation cannot meaningfully be separated from research on the 
development and organization of learning abilities. The title of this 
monograph reflects the enforced shift of emphasis. 

This volume is intended to make the results of the study available to 
investigators of mental development, to students preparing for research 
in this field, and to readers with general interest in theories of learning 
and their applications. 

l hope that the emphasis on criticism in this report does not out- 
weigh in the minds of readers the substantial appreciation due the 
investigators whose imaginatively conceived and energetically prose- 
cuted research programs are producing substantial progress toward the 
building of a new discipline. I thought that it was my task, not to mete 
out praise or blame, but to concentrate on the possibilities of sharpen- 
ing and advancing theoretical analysis of the development and mal- 
development of learning abilities. It should be kept in mind that only 
work which has been considered to be of some merit is cited or dis- 
cussed in this report. A total of 1770 books and articles were examined 
closely enough to be entered in our card file. Those considered to add 
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little information and those valuable for other purposes but not germane 
to the present enterprise are not cited in this volume. 

The relatively extensive survey of the literature was possible only 
with the assistance of Katherine W. Estes, who also smoothed the final 
copy considerably with her editorial pencil. Mariana Fry Weidner also 
contributed to the survey in the earlier stages. The transcription of my 
notes and reflections from many yards of often all but indecipherable 
dictaphone tape was begun by Winifred Woltering and completed by 
Edith Skaar. Miss Skaar also deserves credit for careful typing of 
the final manuscript. 


Learning Th eory 
and 
Mental Development 


Introduction 


The word “and” in the title of this volume signifies a logical product, 
not a logical sum. Merely to survey contemporary learning theory re- 
quires a substantial book in itself (Hilgard & Bower, 1966). Individual 
presentations of the major contemporary learning theories could easily 
fill out a full shelf: and at least the remainder of a sizable bookcase 
would be required to accommodate the accumulated empirical infor- 
mation and related theory concerning the development of intelligence. 
But the interaction of learning theory with the study of mental develop- 
ment has been so slight that an extensive search of the literature yields 
material for only a slender volume. 

This state of affairs is rather striking in view of the common tend- 
ency virtually to equate intelligence with learning ability. Note, how- 
ever, that this identification has never been documented by research. 
The principal trends in both the study of mental organization and the 
study of learning have been away from conceptions of either learning 
ability or intelligence as unitary entities. Further, Guilford (1967) notes 
that, not only have his extensive investigations yielded no factor inter- 
pretable as learning ability but, “In fact, one could look a long way in 
the literature and not find such a factor” Ip. 268]. 

Yet it would be absurd to conclude that learning and intelligence are 
unrelated. One can measure an individual's intelligence at any given 
age only by sampling the skills and knowledge he has acquired. Differ- 
ences in performance between individuals following similar opportuni- 
ties to learn are, by definition, our only source of information regarding 
learning ability, They are also our principal basis for measuring intel- 
ligence. In both cases, one recognizes that test performance must de- 
pend on genetic and maturational factors as well as on experience, but 
scarcely a start has been made toward separating these contributions. 

The reaction to this frustrating situation on the part of a few investi- 
gators, especially in the area of language development, has been to 
dally with the possibility that the development of language, and there- 
fore the enormous range of intellectual performance dependent on lan- 
guage, is independent, if not of learning in a broad sense, at least of 
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the kinds of learning processes studied in the laboratory (Lenneberg, 
1967). f 

This view would seem to be tenable only so long as an investigator 
limits himself to the study of the normal course of development. 
Within this limitation, purely developmental theories have their attrac- 
tions, perhaps the most outstanding example being Piaget's theory of 
intellectual development (1926, 1952, 1967). However, a major part of 
the research on the development of intellect is instigated by problems 
arising when development deviates from the “normal.” It is of no help 
in dealing with, for example, the mentally retarded to be provided only 
a characterization of these individuals in terms of the stage of mental 
development they have reached. It is essential for social as well as sci- 
entific reasons to exploit every possibility of preventing or remedying 
mental deficiency. Substantial progress in these respects can be hoped 
for only to the extent that individual differences in intellectual perform- 
ance can be traced to specific causes. Thus it is only natural that psy- 
chological research on mental retardation has concentrated more and 
more heavily upon the investigation of learning processes. 

The rather sudden acceleration of this type of research has been ini- 
tiated, not by broad strategic considerations, but by a combination of 
increasing pressure of practical problems and the appearance of major 
financial support by the National Institutes of Health, the Kennedy 
Foundation, and other agencies, following the appearance of the report 
of the President’s Panel on Mental Retardation (1962). Not surprisingly 
research has advanced on a jagged front, much being purely empirical 
in conception, some directed by various extant theories. 

The large volume of research on learning currently reported in such 
publications as Advances in Child Development and Behavior (Lipsitt 
& Spiker, 1963, 1965, 1967) and the International Review of Research 
on Mental Retardation (Ellis, 1966a, b, 1968) is surely useful. But it is 
important to ask whether these increasingly substantial investments of 
research effort could be made still more effective. Given that support is 
substantial, the main possibility of increasing effectiveness must lie 
either in methodology or in the relation between research and theory- 
The former has been receiving increasing attention during the past few 
years, starting with reviews by Denny and others presented in Stevens 
and Heber (1964), and followed by various critical articles in the Ad- 
vances and the International Review. 

It is the main purpose of the present study to give similar critical 
attention to the relationship between learning theory and research 
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concerned with mental development—both to normal development, 
with special attention to individual differences in learning abilities, and 
to the types of deviations from the norm which define the area of 
mental retardation. 

In order to make clear the basis for my interpretation and judg- 
ments, I have first assembled a summary of the main concepts and 
principles of contemporary learning theory as I see them, at the same 
time trying to indicate what resources modern learning theory may 
have to offer to research on mental development. Then I have pro- 
ceeded to review principal lines of research on the development and 
maldevelopment of learning abilities, and to interpret the research 
methods and results in terms of this body of theory. 

The third main section of this report concerns the extent to which 
research on the development of learning abilities has been organized 
and directed by theory. In the course of attempting to deal with this 
question it was necessary to examine the particular theories which 
have been involved to appreciable extents and to seek to determine 
how useful they have been in practice. In concluding the final section, 
I have abstracted some implications of the study regarding research 
approaches that might more fully utilize available theoretical re- 
sources. 

Any analysis of the literature bearing on individual differences in 
learning ability must distinguish two main types of relationship to 
learning theory. The first has to do with the way in which individual 
differences are represented within any given learning theory, the sec- 
ond, with the theoretical interpretation of individual differences in var- 
ious types of intellectual performance. Literature pertaining to the 
former aspect has been reviewed by Glaser (1967). This literature is far 
from extensive, and as a consequence learning theorists have fre- 
quently been sharply criticized for giving little explicit attention to the 
representation of individual differences within models for learning 
(e.g., Hirsch, 1962). 

It is certainly true that learning theorists are most often concerned 
with relationships which are believed to be common to all members of 
a species and that in formalizing their theories they nearly always 
present theoretical functions which are assumed to hold for homoge- 
neous populations of organisms. However, it is easy to misunderstand 
the sense in which homogeneity is assumed. Most often homogeneity is 
merely a simplifying assumption, introduced in order to permit the 
beginning of mathematical analysis of what, without rather severe sim- 
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plification, would be an unmanageable tangle of relationships (see e.g.. 
Estes, 1964. pp. 91-93). It is frequently. if not always, clearly recog- 
nized by learning theorists that this phase must be followed later by 
the relaxation of simplifying assumptions and examination of the con- 
sequences for theoretical predictions which arise from individual dif- 
ferences in learning parameters (ce. eg., Trabasso & Bower, 1968: 
Zeaman & Kaufman, 1955). 

To my knowledge the first interpretation of mental development in 
terms of learning theory was undertaken by Thorndike (1931). Thorn- 
dike proposed to account for the 
capacity from the lower or 
increase in the number of 
ably individual differences in learning and intellectual ability among 
human beings could also be attribute 
neuronal connections. 


progressive increase in intellectual 
ganisms to man in terms of the quantitative 
available associative connections. Presum- 


d to variation in number of inter- 
According to Thorndike the quantitative in- 
crease in the number of available neural connections permits the ap- 
pearance of ideas, insight, and reasoning. But effective intelligence is a 
function of the number of associations actually formed. Since one 
would expect a direct relationship between the amount of knowledge 
an individual has acquired and the numbe 
formed in his nervous system, Thorndike believed his analysis to be 
supported by the well-known positive correlation between measures of 
information or vocabulary and other measures of mental ability. 

Over the 30-40 years since Thorndike’s pioneering efforts, there has 
been considerable change in the frame of reference within which 
learning theorists approach problems of individual differences, but 
very little advance in the analysis of specific problems. In his general 
comments on a symposium concerned with individual differences in 
learning, Melton reported (1967) as the e 
contributions that research on individual differences in learning must 
be guided by theories of human learning and performance, and that 


progress in this area must depend upon the framing of hypotheses re- 
garding individual difference variables 
structs of contemporary theories 
239].”! Analysis of the extent to 
being effectuated in contemporary 
future, is a principal concern of the 


r of associative connections 


onsensus of the symposium 


“in terms of the process con- 
of learning and performance Ip. 
which this prescription is actually 
research, or could be in the near 
present volume. 


‘A similar view has been expressed by Seott (in press), 


with documentation in terms 
of a careful review of relevant literature. 


Part A 


Theoretical B ackground 


I. Modern Learning Theory 


Borrowing an expression from Miller (1959), we might characterize 
the viewpoint of the present review as that of a liberalized association 
theory. The history of learning theory, in this country at least, has been 
one of continuing elaboration of association theory, with major contri- 
butions coming during each succeeding generation from more transito- 
rily influential developments, for example, gestalt psychology, informa- 
tion theory, and, most recently, computer analogies. Although 
association theory has its limitations, and other views their points of 
special appeal, it is within the associationistic framework that the pre- 
ponderance of research has been conducted leading to detailed inter- 
pretations of numerous varieties of learning. 

It should be recognized that the conception of the organism in 
modern association theory is considerably richer than a bundle of re- 
flexes or stimulus-response bonds. The specific version which, I be- 
lieve, now marshals broadest support is a stimulus-response formula- 
tion at the level of performance, but not at the level of learning, 
conceived as an inference from performance. In order to be rigorously 
testable and to be practically useful in the guidance and management 
of training and education, learning theory must provide detailed anal- 
yses and predictions of the changes in an individual’s dispositions to 
respond to stimuli or classes of stimuli that develop as a function of 
experience. Thus, the formulation | shall outline begins and ends the 
treatment of each type of learning situation with analyses of perform- 
ance in stimulus-response terms, following the general approach of 
Guthrie (1935), Hull (1943), and Skinner (1938). However, the associa- 
tive process is conceived to be, not a matter of switching connections 
of a response from one stimulus to another, but rather the learning of 
relations between events. The connection between learning and per- 


formance involves feedback relationships in which the process of re- 


sponse selection is continually modified by anticipations of rewarding 


and punishing consequences- . . . 
It will become clear, I trust, in the following presentation that only a 


of modern association theory could put it in 


drastic misconception : ; 
ize the importance of structure and 


opposition to views which emphas 


8 Modern Learning Theory 


organization in behavior as, for example, the “structure-of-intellect™ 
theory of Guilford (1967). On the contrary, the basic properties of the 
associative process lead to the formation of progressively higher-order 
perceptual and motor units. The result of an extended 
learning is not simply a proliferation of associative relations between 
elementary stimuli and motor reactions, but rather a hierarchical or- 
ganization of associative relations between stimulus 
terns of varying levels of complexity. 

In this section I shall ske 


period of 


and response pat- 


tch the set of concepts and assumptions 
drawn from contemporary theories of learning which I shall use as a 
basis for interpreting research on learning in relation to mental devel- 
opment. While I assuredly would not claim a concensus among 
learning theorists on the framework presented, I be 
assumptions have broad empirical support and that, subject to differ- 
ences in emphasis and particular terminology, they constitute the 
framework within which much current research on learning is being 
conducted. For purposes of exposition I shall first list the principal 
assumptions with only brief elucidation, then give illustrative interpre- 
tations of a few standard types of learning experiments, and finally 


consider applications to problems of mental retardation and accelera- 
tion. 


lieve the principal 


1. the elements of association 


I take the term learning to refer always to a change of state of the 
organism inferred from performance — 


tively enduring change in beha 
result of previous experience. At a conceptual level, learning is repre- 
sented by the formation of associations between representations of 
behavioral events which have occurred in temporal contiguity. By be- 


havioral eve action to any stimulus input, 


from my earlier theoretical 
developed within the stimulus- 


e of those earlier formulations 
in terms of a process more closely 
nditioning (Spence, 1951). Examples 
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are conditioned anticipation of aversive events and of reward (Estes 
& Skinner, 1941; Estes, 1948), latent learning (MacCorquodale & 
Meehl, 1954), incidental learning (Postman, 1964), and conditioning 
under curare (Solomon & Turner, 1962). 

The associations between representations of behavioral events are 
assumed to be characterized by multiplicity and random fluctuations in 
state of activity in accord with the mechanisms developed within ear- 
lier expositions of stimulus sampling theory (see Estes, 1959; Neimark 
& Estes, 1967). Thus, of the total number of elementary associative 
connections which might be established between representations of 
two behavioral events, only a subset may be established on any single 
learning trial, and of these only a random sample will be in an active 


state on any single later trial. 


2. conditions of association 


The contiguous occurrence of two behavioral events in temporal 
succession is assumed to be the necessary and sufficient condition for 


establishing temporary associations, or, in other terms, for entering the 


relationship between these events in short-term memory. Following a 
contiguous occurrence, the associations revert to their initial state at a 
generating a short-term retention curve. During 


random rate over time, 
is normally a matter of sec- 


the short-term retention interval, which , i 
onds, some of the associations may become consolidated instead of 


reverting to the initial state. In this case the associations will be reacti- 
vated on a later occurrence of one of the associated events and we say 


that there has been a transfer of information to long-term memory 


storage. If this transfer does not occur, then the associations will not 


be reactivated by later occurrence of either event and we say that the 


information has been lost from memory. 
3. conditions of transfer to long-term memory 
ant condition for consolidation is an interval 


free of interference from the occurrence of similar events (that is, 
events which would involve associative elements overlapping with 
those just established) or unusual disturbances of the nervous system 
such as those produced by electroconvulsive shock. Further, transfer 
to long-term memory is facilitated by overt or covert repetition of the 
associated behavioral events, that is, the type of activity termed re- 
hearsal. Finally, there is considerable evidence to indicate that the ac- 


Perhaps the most import 
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tivity which facilitates transition to long-term memory is initiated only 


if the second of two associated events resolves some uncertainty on the 
part of the learner (see, e.g., Egger & Miller, 1962: Trabasso & Bower, 
1968). Suppose, for example, that a stimulus compound consisting of 
components A and B is presented to an organism, j 
If as a result of previous learning, A has been established as a reliable 
predictor of C, then associations between B and C 
consolidated in long-term memory. 


B 


followed by event C. 


will not become 


4. retrievability from long-term memory 


Given that associations have been formed and consolidated between 
two behavioral events, A and B, which have occurred contiguously, a 
later occurrence of the Sequence AB gives rise to the reaction we term 
recognition. Whether a later occurrence of A alone will activate a rep- 
resentation of B in memory depe 
cluding number of previous re 
val, and activities on the 


nds upon a number of conditions, in- 
petitions, duration of the retention inter- 


part of the learner which modify the form of 
memory storage. The principal role of the 


retention interval in relation 
to retention loss may 


be to permit the formation of interfering associa- 
tions between A and other events. The Principal function of repetition 
is to permit an increase in the number o 
tandom-sampling proces an individual upon the later 
occurrence of A to reinstate the overt response components of event B 
may, at least in the case of human beings, be greatly facilitated by a 
type of activity termed recoding (Bower, 1967; Greeno, 1969: Law- 
rence, 1963). Of special importance is the c 
tory representations by means of verbal rehearsal, For most individu- 
als, at least, the visual components of me 
contain relatively little informat 
racy of recall is possible if, 
ious Properties or aspects 
converted to auditory 


f associations as a result of the 
s. The ability of 


version of visual to audi- 


mory of a previous experience 
ion. Consequently a great gain in accu- 
during the short-te 
of what has bee 
storage. 


rm memory interval, var- 
n seen are verbalized and 


5. motivation and response evocation 


Tt is assumed that, except perhaps for the 
Sponse evocation depends upon 
receptors and input from drive 
involve the midbrain 0 


punishing effects upon 


simplest reflexes, re- 
the joint action of stimulus input from 
mechanisms. The latte 
enters which are known 


direct electrical stimulat 


r presumably 
to exert rewarding or 
ion (see, e.g.. Miller. 
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1963: Olds, 1961), but we shall not attempt any detailed physiological 
interpretation here. Originally, these specific mechanisms are acti- 
vated by unconditioned stimuli, for example, the mechanism asso- 
ciated with hunger is activated by the taste of food, the mechanism 
associated with pain by impinging traumatic stimulation. The result of 
activity of a drive mechanism is to generate what may be termed facili- 
tatory or inhibitory feedback. For example, the taste of food initiates 
facilitatory feedback which then summates with the inputs of stimuli 
associated with ingestive behaviors and leads to the continuation of 
consummatory activity. If, while the animal is eating, it receives a 
painful shock, the result is initiation of inhibitory feedback from the 
negative drive system. An important aspect of this feedback is that it 
reciprocally inhibits the activity of the currently active positive drive 
system, thus cutting off facilitatory feedback and leading to the cessa- 
tion of consummatory behavior. At the same time, the feedback from 
the negative drive center summates with unconditioned stimuli for 


such reactions as attack or flight. 

By associative learning, proceeding in accord with the assumptions 
of Section 1 above, control of these positive and negative drive mecha- 
nisms is extended to stimuli which have preceded the original uncondi- 
tioned stimuli. If the sight of food has preceded the taste on an earlier 
occasion, then the sight of food will lead to reinstatement in memory of 
a representation of the unconditioned stimulus, which will in turn ini- 
tiate facilitatory feedback. thus increasing the likelihood that the re- 
0 ig from the sight of food to ingestion will again 
run off without interruption. On the other hand, upon later occurrence, 
a stimulus which has preceded electric shock will activate a central 
representation of the shock experience, thus leading to negative drive 
activity and consequently to reciprocal inhibition of positive drive sys- 
ongoing positively motivated be- 
a stimulus which has 


Sponse sequence leadit 


tems which have been supporting 
havior. This suppression of ongoing behaviors by Is 
preceded a traumatic event is the much investigated conditioned emo- 
(CER) first analyzed by Estes and Skinner (1941). 


tional response 8 me 
motivational and associational 


The result of this combination of i 
some learning experience the organism’s 


by anticipations of rewards or pun- 
1g to increases in positive feedback 


mechanisms is that after 
behavior is continually modulated 
ishments, behavior sequences leadit : 
being preferentially selected and behavior sequences leading to de- 


creases in positive feedback or to pun 1 eins ! 
hanisms which is of special importance for dis- 


ishment being inhibited. A con- 


sequence of these mec 
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crimination learning is that in any situation involving differential 
contingencies between some stimuli or stimulus aspects and reward, 
and between other stimuli or stimulus aspects and nonreward or pun- 


ishment, the organism will learn to approach the former and to avoid 
the latter. 


6. conditions of retention and transfer 


The three principal conditions of rete 
are (a) Spontaneous, random fluctuation 
mentary associations, (b) change 
fering learning. 

The first of these refers to the 
that the associations established upon contiguous occurrence of any 
two behavioral events revert to their initial state at a random rate over 
So far as is known this short-term 


mous process that can be modified 
only by varying the number of associations establishe 


ntion loss that we can identify 
s in the state of activity of ele- 
s in stimulus context, and (c) inter- 


assumptions (see Section 1 above) 


d upon a given 
If, of three individuals exposed visually to the 
same paired-associate item, one simply looks at each member, one also 


verbalizes each member aloud, and one in addition conjures up a vi- 


us in some context, we should expect the 


h the largest number of temporary associa- 
tions and the first individual the least; thus they would rank in this 


order with respect to recognition or recall after a short retention in- 


The second factor refers to the fact 


that situations are never repro- 
duced identically and that, other thin 


curred in Succession, the probability 
first on a later occasion will lead to recall 
be directly related to the degree of simi- 


with the fact that if the stimulus S, has 
e stimulus S, on one occasion and with a 
er occasion, then upon later 
to whether S, or S,’ will be 
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constructive intellectual activity on the part of the learner which leads 
to selective storage of associations which avoid the interference. 

For example, suppose the stimulus combination AB has been fol- 
lowed by C on one occasion and the combination AD has been followed 
by E on another. Confusion in recall will be avoided if the learner is led 
to attend selectively to the unique components, the B and D, and to 
store in long-term memory the associations BC and DE, thus avoiding 
errors that might arise from associations of component A with either C 
or E. The learner can achieve this result if, for example, upon presen- 
tation of the combination AD-E he recognizes the ambiguous associa- 
tions of component A with C and E and thus rehearses only the asso- 
ciation D-E following the second presentation. 

The basic condition for transfer of learning from one situation to 
another is assumed to be communality in constituent associations. If 
events A and B have occurred in succession and have become asso- 
ciated in memory, then a representation of B will tend to be reinstated 
in memory on later occasions by other events which are similar to or 
overlap with A in proportion to the degree of similarity or overlap. It is 
assumed that once associations have been formed and consolidated 
this transfer occurs in a largely automatic manner. However, various 
activities on the part of the learner during a learning experience can 
influence transfer to an important extent. On the one hand, as illustrated 
in the preceding paragraph, undesired transfer can be reduced if the 
learner is led to attend selectively to stimuli which are unique to a given 
situation and to rehearse only associations involving these unique 
aspects. On the other hand, transfer can be enhanced if the learner 


attends selectively to stimuli or stimulus properties which are common 


to the original and the transfer situations and selectively rehearses 


these. 


7. probability learning 
Although within a more formal statement of the theory sketched 
an be derived from other, more elementary 


here, probability learning c 1 A : 
mechanisms, we shall simply state it as an independent assumption 
ith respect to relative frequen- 


that the organism stores information w 

cies of sequences of events. If, for example, event A has been followed 
by B and C on different occasions in the past, then the organism’s 
tendency to anticipate B following the occurrence of A is directly re- 
lated to the relative frequency with which B has actually followed A. 
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Probability learning provides the basis for the continual ee 
behavior by variations in anticipation of reward and ene 8 dn 
exposed to an environment containing stimuli which have had di j 5 
past histories of association with punishment and reward, the organist! 
receives positive feedback on the average whenever it shifts aftention 
from a stimulus which has had a lower, to one which has had a higher 
relative frequency of association with r i E 
inhibitory feedback whenever it shifts attention from a stimulus mort 
highly correlated with reward to one 
in the past. 


eward and, conversely, receives 


less highly correlated with reward 


8. formation of hierarchies of stimulus and response units 


A major consequence of the 
organism continually builds higher-order units, generating the phe- 
nomena termed stimulus patterning 
number of originally unrelated stimuli repeatedly occur together, mul- 
tiple associations form among the organism’s reactions to the various 
components, so that ultimately the occurrence of A 
states a representation of the whole set in memory; once this learning 
has occurred, the set of stimuli enters into association with other be- 
havioral events as a unit. One describes the end result of this learning 


; ti f 
by saying that the organism has come to respond to the collection o 
stimuli as a pattern. 


Similarly, members of a set of motor re 
evoked in close contiguity come to be 
component reinstate 
may enter into assoc 


r ‘varning 
associative process is that the learning 


and response integration. If a 


any component rein- 


actions which are repeatedly 
associated so that evocation of a 
$ a representation of the entire set, which then 
iation with other behavioral events as a unit. The 
former aspect of unit formation is most 
read, when letters, syllables, and words come to act in certain respects 
as unitary patterns. The latter aspect k 
learning to speak and in the rbal control of behavior 
as, not only words, but propositions expressing rules and strategies 
come to act as units in the continuing elaboration of the associative 
learning process. 


conspicuous in learning to 


is similarly conspicuous in 
development of ve 


Organization of still higher-orde 
discriminative and conceptual lear 
tion problems are frequently gene 
sions, as defined by the experime 
contingencies of rewarding and no 


r stimulus-response units occurs in 


ning, and in free recall. Discrimina— 
rated by combining stimulus dimen- 
nter, in various ways and arranging 


nrewarding outcomes upon response 
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to particular dimensions or combinations of dimensions. Once the or- 
ganism has learned to associate a common perceptual or mediating 
response with the various values on a stimulus dimension, for example 
colors or forms, it is in a position to learn to approach or avoid entire 
dimensions in new problems, thus greatly short-cutting the process of 
learning to approach or avoid the various constituent stimuli. Similarly, 
sets of responses, such as names of colors, names of states, or the like, 
can be associated with common mediating, or categorizing responses; 
then in new situations the individual’s capacity for recall can be 
greatly amplified. Suppose, for example, that the list of words read to a 
subject in a free-recall experiment included the first ten digits, the 
primary colors, and the names of the New England states. A suff- 
ciently sophisticated learner would need only to rehearse and transfer 
to long-term memory the category names, or code designations, of the 
three classes in order to be prepared to reproduce the entire list on 


request. 


9. learning to learn 

r developmental changes in learning abilities 
more localized changes in learning 
tion of experience with particular 


Beyond the broade 
treated in the following section, 
rates are generally observed as a func 
types of tasks or materials. Within the theory outlined here, these lat- 
ter, more specific, types of improvement in learning rate are derivable 
from other aspects of the theory and do not require independent as- 
sumptions. Nonetheless, because of the special importance of 
“learning to learn,” or the development of learning sets, for interpreta- 
tion of mental development and retardation, it will be useful to summa- 
rize the principal concepts involved. . 
nt learning to learn represent, not changes in 
learning rates, but simply transfer from learning which has occurred in 
Situations involving common associative relationships. An elementary 
example is the progressive increase in rate of acquisition during a se- 
quence of acquisition and extinction sessions in classical or operant 
conditioning (for a review of the literature, see Capaldi, 1967). As a 
result of learning in earlier sessions. the unconditioned or reinforcing 
stimulus itself becomes a signal for further reinforcement; since this 
component of the stimulus complex which normally precedes rein- 
forcement during acquisition is necessarily absent on extinction trials, 
and thus is protected from extinction, its effectiveness is immediately 
reinstated upon the first trial of reconditioning. This type of increase in 


Many cases of appare 
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learning rate by virtue of specific transfer is especially pervasive in the 
learning of language. Associations formed to the roots of words, for 
example, and to various prefixes and suffixes contribute ee i 
appropriate performance upon the occurrence of new words involving 
common elements. 2 

Whereas the former examples involve specific transfer of associative 
relationships between the response being le 
transfer situation, progressive incre 
tion and conceptual le 


arned and elements of the 
ases in learning rate in discrimina- 
arning situations are probably more often me- 
diated by observing-or attentional responses to cues or stimulus dimen- 
sions which are common to broad classes of problems. Over a series of 
form-color discrimination problems, for example, the individual learns 
that attention to the relevant dimension is associated with reward 
whereas attention to irrelevant dime 
in later problems of the series attention is confined to relevant stimulus 
dimensions from the beginning and “unnecessary” errors are elimi- 
nated. Similarly, in learning to read the child learns to attend to the 
forms of letters rather than to size of print, or the like. 

Even when successive discrimination problems are unrelated with 
regard to stimulus relationships, the initially naive organism will ex- 
hibit a progressive increase in acquisition rate simply as a conse- 
quence of learning that pre 


nsions, such as size, is not, so that 


mature choice responses are associated 
with higher probabilities of nonreward than delayed choice re ponses, 
thus coming to inhibit overt response on any trial until all of the stimuli 
in the display have been scanned. 

In verbal learning and recall 


in addition to all of these more sleman 
tary mechanisms, improvements in learning or retention of various 
degrees of specificity may result from the acquisition of rehearsal and 
organizing strategies. With regard to the 
ilar tasks the individual may learn which 
or relationships need to be se 
creased probabilities of rewar 
of categorizing and grouping 
error, may be rewarded and 
series of related tasks. 

It will be observed that, 
“learning to learn” 


former, over a series of sim- 
particular stimulus properties 
lectively rehearsed in order to lead to in- 
rd. With regard to the latter, various types 
Strategies, usually discovered by trial and 
then facilitate recall performance over a 


according to the present analysis, the term 
is in a sense a misnomer. An individual who has 


of new associations; rather, he needs 
attain a given level of performance. 


II. Interpretations of Standard Learning Situations 


; In this section we review several standard learning experiments, 
identifying basic processes within the framework of the theory 
sketched above and attempting to identify important sources of indi- 
vidual differences in learning ability. 


l. classical conditioning 


. Pavlovian conditioning is perhaps the simplest of the standard learn- 
ing experiments in that the course of conditioning may be relatively 
free of specific dependence upon previous learning. Nonetheless, 
even in this apparently highly simplified experimental paradigm, there 
is room for multiple processes. And complexities multiply rapidly with 
even seemingly innocuous modifications of classical experimental ar- 
rangements. 

Let us consider, as a specific example, ‘ 
with an electric shock to the cheek as the unconditioned stimulus (US). 
Conceptually, this process is somewhat simpler than the more fre- 
quently studied variant with an air puff to the cornea as the US or than 
Salivary conditioning with acid as the US. The simplification arises 
from the fact that the occurrence of the conditioned response (CR) has, 
effect upon the individual’s reaction 
to the US. Prior to conditioning, the shock elicits a defense reaction, 
including the eye blink and the activation of a negative drive amplifier 
Which provides facilitative input to the family of defense reactions. The 
Conditioned stimulus (CS), usually a flash of light or the sound of a 
buzzer, normally evokes an original reaction of which an eyelid re- 


Sponse is only a minor component. Over a series of presentations of 
the CS followed by the US, associations form between representations 
tuations both in the environ- 


of these stimuli, but owing to random fluc j à 
ment and in the state of the subject only a sample of the population of 
available associations is formed and consolidated on each trial. 


The first association formed is sufficient to lead, upon occurrence of 
ss aspects of the unconditioned reaction 


e drive input. Frequently after 


conditioning of the eye blink 


sö far as jc 8 
o far as is known, no appreciable 


the CS 3 
he CS, to anticipation of gro 


(UR), including in particular the negativ 
17 
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z ; il Til i svidence of antici- 
only one conditioning trial, the individual will yield evidence of ant 


j i i F a considerable 
pating the US upon presentation of the CS. However, a considera 
number of trials may be required before enough associations ar 


formed and consolidated so that a CS presentation brings into imme- 
diate memory a sufficiently detailed representation of the US to lead 10 
an anticipatory occurrence of the particular response pattern, in this 
instance the eye blink. which meets the criterion for a CR. = 
A principal reason why individuals of very different learning ability 
generally exhibit little if any difference in rate of simple CR aequis 
may be the fact that in standard experimental procedures many 
sources of individual differences in conditioning rate are minimized. 
Individual differences in intensity or extent of response to the l S 5 
generally compensated experimentally by adjusting the intensity of thie 
US for each subject so as to yield URs of approximately the same 
magnitude. Similarly, individual differences in the tendency to observe 
or attend to the CS are minimized in standard experimental nage 
dures by measures which are taken to insure adequate stimulation 0 
the subject by the CS on cach trial. One might suppose that there 
would be major differences in the number 


of associations formed per 
trial when other factors 


are held constant: however neither this supp% 
sition nor the related expectation that learning ability should be inti- 
mately tied up with rate of formation of associations has found sub- 
stantial experimental support. Measures which reduce individual 
differences in reactions to the CS and US have been found to reduce 
individual differences in observed rates of acquisition, sometimes al- 
most to the vanishing point (see, e.g., Voeks, 1954), 

Why should individual differences 
more systematically related to other 
individual differences in 
that the extinction proce 
involves a new conditio 
anticipate absence of t 
however, is less well de 


in rate of extinction of a CR be 
measures of learning ability than 
acquisition? The answer may lie in the fact 
dure is less well controlled. In part, extinction 
ning process in which the individual learns to 
he US following CS presentation. This event. 
fined than US occurrence, and there is no way 
(comparable to adjusting intensity of a US) by which the experimenter 
can compensate for individual differences 
changed state of affairs following the ( 
a CS ona late acquisition trial, the inc 
stimulus traces from the URs of the 
tions of these in immediate 
the first extinction 


in responsiveness to the 
S. At the time of occurrence of 
lividual may be influenced by the 
previous trials or by representa- 
memory. However, immediately following 


trial this source of contextual stimulation is no 
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longer available. The change in context could lead to a decrease in the 
individual's tendency to anticipate the usual consequence (that is, the 
occurrence of the US) even prior to any new learning (countercondi- 
tioning). 

The importance of these changes in context depends to a major ex- 
to background changes. To the 


tent on the individual's attentivenes 
extent that the influences of changes in context are mediated by short- 
term memory processes, and especially by verbal processes, they 
would tend to be related to mental ability measured by tests involving 


these processes 

Some implications of the present interpretation are readily testable. 
For example, although rate of initial acquisition of CRs has been found 
to vary little over a wide range of IQ levels, we predict that if the same 
individuals were given a series of repeated acquisitions and extinctions 
of the same CR ina given situation, the progressive increases in acqui- 
sition rate routinely observed under these conditions would be greater 
for individuals at higher 1Q levels. Therefore a relationship between 
rate of conditioning and mental ability would appear with experience 
even under conditions such that none would be manifest during initial 


acquisition. 

This prediction is based on the idea that the same types of change in 
background, or stimulus context, which contribute to rapid extinction 
following initial acquisition would operate in reverse, so to speak, to 
facilitate reacquisition. The individual who is sensitive to the more 
r sequences of trials may learn that 
l of a series greatly increases the 
with experience over a 


remote stimulus relationships ove 
the occurrence of a US on any tria 


Probability that it will occur on the next: thus. ce on 
extinction blocks, such an individual 


Sequence of acquisition and 1 ine 
f both conditioning and 


might come to exhibit relatively high rates o 
extinction. 

Whereas the number of temporary 
Conditioning trial evidently involves sufficient multiplicity and redun- 
dancy so that individual differences in this respect are not a major 
Source of variance in conditioning rates, one might suspect that an 
important locus of individual differences would lie in the “triggering” 


of the process of consolidation or transfer to long-term memory stor- 
process to disturbance by events 


ing a conditioning trial. How- 


associations established on a 


age, or in the susceptibility of this 
oc ; 9 3 x : 
°ccurring during a short interval follow 


ever, although conditions influencing the course of consolidation have 
egun to receive rather intensive study (see, e. g., Grossman, 1967, pp. 
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797-814), there seems to be no information available ane ind 
vidual differences in this process or relationships to standard meas 
l ability. x 5 

bo pane ernie especially that, since the eye blink is myo 
voluntary control in normal individuals, and doubtless in all but t “i 
most severely retarded, it is only under especially simplified ponditions 
that one can expect to study conditioning uncomplicated by more 
complex mental processes. Adequate simplification is probably approx- 
imated only when a US, such as shock to the cheek. is used which 5 
not modified appreciably by occurrence of the CR and when during 
the experiment the subject is engaged in some task which occupies his 
attention and minimizes the likelihood that verbal expectations con” 
cerning occurrence of the US will modify his response to the cs. . 

When a US such as an air puff to the cornea is used, the aoe 
necessarily becomes more complex, since occurrence of the eye * 
during an appropriate interval following onset of the CS reduces sie 
unpleasant stimulation from the air puff, thus establishing avoidance 
contingencies. In this case the individual is in a position to learn ë 
simple discrimination, namely that occurrence of the CS plus the eye 
blink leads to less unpleasant stimulation of the cornea than occur 
rence of the CS and omission of the eye blink. | 

Acquisition of this discrimination is of course facilitated by verba 
rehearsal of the relevant stimulus relationships, and thus it is not ae 
prising that, in the standard eyelid conditioning situation with air pu : 
as the CS, rate of conditioning is considerably higher for anne 
dent subjects than for retarded groups (Ross, 1966). It remains to bé 
determined whether the conditioning rates for the retarded could 
brought up to the normal level in this situation by appropriate instruc- 


1 . $ i g ingen- 
tion and training with respect to attention to the stimulus conting 
cies and rehearsal of stimulus relationships. 


2. operant conditioning 


Superficially, operant conditioning might seem simpler than classical 
conditioning, since it does not involve the transfer of a response 115 
one stimulus to another but only the strengthening of a response whic 
the individual already has some tendency to make in the given situa- 
tion. Within the present theoretical framework, however, it is recog 
nized that response tendencies are not simply strengthened in a direct 
way by rewarding consequences. Rather, a rewarded occurrence pro- 
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vides an opportunity for the individual to learn associations between 
stimuli encountered in the course of executing the given response and 
the rewarding consequence. 

In an operant situation the learner 
particular stimuli at particular times as in t 
experiment, and at the beginning of training 
when he happens to run through the prescribed stimulus-response 
sequence in the course of more-or-less random activity. As a result of 
5 s be formed between the suc- 

A avioral events in the sequence, the likelihood of this occur- 
ring being increased to the extent that the individual attends to the re- — 
lationships between successive stimuli and rehearses these following 
the associations have formed, the indi- 
rst stimulus in the sequence, will 
facilitative feedback and 
ence leading 


is not constrained to attend to 
he classical conditioning 
he first receives reward 


this experienc ree 
his experience, learned associations may 


the receipt of reward. Once 
vidua : 
idual upon later encountering the fi 


be led to anticipate reward, thus generating 


increasing the likelihood that the stimulus-response sequ 


to reward will run to completion. 

5 Although there are doubtless individual differences in rates of forma- 
tion of associations and transfer from short- to long-term memory 
storage, it seems likely that, to even a greater extent than in the classical 
conditioning situation, individual differences in rate of operant condi- 
with differences in tendencies to attend 
f the stimulating situation. 
is assumed that th 
ed by increases and in- 
In order for an individu- 
trol of feedback from 


tioning are associated largely 
to relevant stimuli or aspects © 

In the present theoretical framework it 
nism producing facilitative feedback is activat 
hibited by decreases in anticipation of reward. 


al’s behavior to be brought under the con 3 : l 0 
anticipation of reward he must learn that changing a stimulus situation 


in one direction (for example, by approaching a lever) increases the 


probability of reward and changing it in the other direction (by moving 


away from the lever) reduces probability of reward. This kind of con- 
le, the individual makes the response 


trol is not obtained if, for examP u Pai 
which produces reward while attending to an irrelevant aspect o the 
Stimulus situation. 

In the training met! 
much more care is generally taken i 
responses and rewards than in insuring 
opportunity to observe relationships betwee 


probability or immediacy of reward. : ial 
The importance of stimulus variables is seen espec! 


. C. k. NT., Wos song 


r 


e mecha- 


ents of operant conditioning, 
ntingencies between 
learner will have an 
hanges and 


stud 
n arranging CO 
that the 

n the stimulus € 


nods used by 


lly clearly in the 
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studies of operant conditioning of verbal behavior (reviewed by Kras- 
ner, 1958). In free-responding situations. the subject produces a contin: 
uous stream of speech, and utterances belonging to some particular 
class receive reward. Opportunities for the subject to perceive relation- 
ships between stimulus changes and reward probabilities are slight, 
and characteristically, rate of conditioning is very low. 

By contrast, when the situation is modified so that each successive 
trial begins with the display of a set of stimuli, response to one of 
which eventuates in reward, conditions are relatively favorable for the 
perception of stimulus-reward relationships, and rate of conditioning is 
relatively high (Taffel, 1 


Thus over the wide range of operant con 
ditioning situations in which stimulus variables are relatively uncon- 
trolled, we should expect wide individual differences in observed rates 
of conditioning to result from individual differences in tendencies to 
utilize observing or attentional responses with respect to stimulus rela- 
tionships. 

Even more than in the case of the simple operant responses usually 
studied in the conditioning laboratory. acquisition of the finely differen- 
tiated response patterns involved in perceptual-motor skills depends 
to a major extent upon the learning of relationships between stimulus 
changes and changes in probability of reward. To take a simple exam- 
ple, if a child is being trained to touch with a pointer a spot of light 
which may appear anywhere upon a screen, learning, if manifest al all, 
will be painfully slow if a separate stimulus-response association has 
to be formed for each possible location of the spot. Acquisition of the 
skill depends upon learning that reduction in the distance between the 
pointer tip and the light spot, regardless of the location of the latter. 
signifies an increase in probability (or reduction in delay) of reward and 
that, contrariwise, increase in this distance signifies a decrease in 
probability of reward. When this relationship has been learned, the 
child’s movement of the pointer is continuously modulated by the vi- 


sual feedback from the changing distance between the pointer and the 
spot. 


We must expect major individual differences in speed of acquisition 
of perceptual-motor skills to be related to differences in attentiveness 
to relevant stimulus dimensions and, similarly, major differences in 
efficacy of different types of training procedures to be related to the 
extent to which they successfully direct the learner's attention to the 
relevant stimulus relationships. 
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3. discrimination learning 

In modern learning theories discrimination learning is generally as- 
sumed to involve the same basic processes as simple classical or in- 
strumental conditioning. Particular theories differ largely with respect 
processes are combined or 


to assumptions as to how more elementar 
organized in discriminative situations. Within the theoretical frame- 
work developed in the previous chapter, a discrimination problem can 
be analyzed in terms of a set of possible stimulus-response sequences, 
each beginning with attention to some aspect of the stimulus situation 
present on the trial and eventuating in a rewarding or punishing out- 
come, 

The learning which goes on in the course of mastering a discrimina- 
tion problem is a matter of learning the relationships between the suc- 
cessive behavioral events in these stimulus-response sequences. The 
ssociations, simply 


process involves the establishment of temporary as 
as a result of contiguous occurrences of events, followed by the consol- 
idation, or transfer to long-term memory, of subsets of these associa- 
tions as in simpler conditioning situations. Owing to the structure of 
the discrimination problem, stimulus elements which initiate some 
sequences will, in the course of learning, come to serve as signals for 
anticipation of reward whereas others will become signals for nonre- 
ward. 

On the average. discriminative performance is expected to improve 
as learning proceeds, but the relationship is by no means one to one. 
At the beginning of learning. performance with respect to the stimuli to 
be discriminated is generally random. At the end of acquisition, atten- 
tion to various aspects of the stimulus situation leads to different de- 
grees of anticipation of reward and consequently different degrees of 
facilitative feedback to behavior sequences which have followed these 
stimulus- response sequences having the 


in the past. As a consequence. i 
greatest likelihoods of leading to reward tend to be selected in the 
presence of each of the stimulus situations involved in the problem. 

Improvement in correct responding may be expected to occur pri- 
marily at points in the learning sequence where the individual shifts 
attention from cues which are unrelated to cues which are more 
closely related to reward and nonreward, that is, from irrelevant to rel- 
evant components or properties. During sequences of trials when the 
learner is attending only to irrelevant cues and basing his responses 


24 Interpretations of Standard Learning Situations 


upon these, there will be no observable change in probability of 1 
responding. Therefore, as emphasized by Zeaman and House ( |9 55 a 
their analysis of discrimination learning, major sources of individua 
differences in rate of discrimination learning must be differences in ini- 
tial tendencies to sample, or attend to. different aspects of the stimulus 
situations involved, and also individual differences in sensitivity to the 
different degrees of anticipation of reward. ; 
Doubtless, individual differences in both of these respects are influ- 
enced to an important extent by learning which occurs in many dif- 
ferent situations in the course of normal development or in the course 
of specific training in a particular type of situation. The former aspect 
has received considerably more attention in the literature (see. ee 
cially, Zeaman & House, 1963) than the latter. Nonetheless, it SPINS 
clear in terms of the present analysis that an individual's level of antic- 
ipation of reward upon entering a new discrimination problem should 
be an important factor in rate of le 


arning. If, for example, an individual 
is accustomed to failure in any given type of discriminative situation, 
then upon facing a new problem his a priori anticipation will be only 
50% reward. Stimulus-response sequences which eventuated in 50 s 
reward (that is, those arising from attention to irrelevant aspects of the 
situation) yield the rate of reward he has come to expect and conse- 
quently there is no reason for him to deviate from his habitual mode of 
response. However if the individual 


is accustomed to success, then 
upon confronting a ne 


w problem the 50% frequency of reward asso- 
ciated with response to irrelevant features of the situation represents à 
distinct reduction from his a priori expectations: thus ongoing behav- 
iors will tend to þe inhibited by reduced facilitative feedback, permit- 
ting him to sample other stimulus-response sequences which may lead 
to reward with higher probability, 


. " ` soi à m 1 8 its 
The importance in discrimination learning of various general habi 


which carry over from one problem to another is so great that the dif- 
ferences in rate of dise 


mentally naive individua 
bers of different species 
Thus even the rat le 


rimination usually observed between experi- 
iff. em- 

ls of different ages on IQ levels, or even men 

> can often be reversed by appropriate training. 


arns with experience in a two-choice discrimina- 
tion situation to slow down at the choice 


ulus display fully before making its overt choice 
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empirical grounds for the supposition that short-term memory enters 
into discrimination learning in a more complex way than into simple 
conditioning. Although, on the whole, discrimination learning, like 
most other forms of learning, tends to be facilitated by distributed 
practice, a number of studies provide rather clear indications that the 
relationship is reversed at long intervals, with an optimum intertrial 
interval in the neighborhood of 3 or 4 min for visual discrimination 
learning in the rat (Pennington & Thompson, 1958; Weinstock, 1958). 
A theoretical interpretation of the optimum interval can be offered in 
terms of two processes that would be assumed to be operative within 
the present theoretical framework. First, if successive trials are too 
ssociations established on one trial will 


close together, the temporary 
have little or no opportunity for consolidation, or transfer to long-term 
memory, before onset of the next trial sets up interfering associations; 
this process in itself would yield a simple, monotone function, relating 
speed of discrimination learning directly to intertrial interval. 

In another respect, however, relatively short intertrial intervals 
would be advantageous for discrimination learning. If the intertrial in- 
terval is such that the event sequence of one trial is still fully repre- 
sented in short-term memory when the next trial occurs, the learner 
will be in a position to notice relationships between changes in the 
stimulus and changes in the rewarding outcome which otherwise would 
be likely to escape attention. Thus, for example, Weinstock (1958) 
found that rats showed no progress whatever toward learning a dis- 
crimination between a flickering and a constant light over a large 
number of trials when the intertrial interval was 24 hr but that they 
learned the same discrimination rapidly when the interval was reduced 


to 5 min. 


My interpretation is that the 5 D 
tending to either condition of illumination unless trials are close 


enough together so that it notices the change from one to the other. 
Presumably, other things equal, the probability of attending to changes 
in the stimulus situation would be a monotone decreasing function of 
intertrial interval. The combination of the two gradients would yield a 
nonmonotone function for rate of discrimination as a function of inter- 
trial interval, with an optimum at some intermediate value, in apparent 


animal has very low probability of at- 


accord with the combined results of a variety of studies in this area. 
second of these two processes may be much 


The importance of the ’ 
since by means of verbal 


greater in human beings than in animals, 
rehearsal the human learner can greatly extend the effective short- 
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term memory function. We should expect both that individual differ- 
ences in human discrimination learning would be related to rehearsal 
tendencies and also that deficiencies in diserimination learning could be 
substantially modified by special training which leads to the establish- 
ment of effective rehearsal strategies, 


4. learning set 


The term “learning set“ was introduced by Harlow (1949) to refer 
to a specific form of learning to learn observed in visual-diserimination 
studies with monke 


ss. In the standard experiment the animals were 
given series of similar but formally unrelated problems (unrelated in 
the sense that solution of any one problem gave no information con- 


cerning the solution of others in the series). Progressive improvement 


was observed in the rate of learning these discriminations: this rate ol 
improvement defines the concept of learning set. Analogous experi- 
ments were conducted later in paired-associate learning situations 
with human subjects by Greenberg and Underwood ( 1950) and by Meyer 
and Miles (1953). 

What is the theoretical basis for the progressive improvement in dis- 
crimination learning ability exhibited in these learning set studii 
Any type of theory can accommodate the phenomenon of progressi 


interproblem improvement., but only certain types of theories provide a 
positive basis for expecting it. The classical association or conditioning 
theories of, for example, Hull and Spence, or basie stimulus sampling 
theory can interpret the phenomenon by assuming changes in param- 
eter values. 

But what processes should be expected to lead to progressive 
changes in parameter values as a function of experience? One possi- 
bility lies in such general strategies as slowing down at the choice 
point before selecting a response (Estes & Lauer, 1957). Within the 
type of discrimination learning theory developed by Zeaman and 
House (1963), there is a more constructive 
provement. The same process of learning attentional respons 
vant dimensions which is assumed to be ir 


basis for interproblem im- 
ss to rele- 


volved in the learning of any 
one discrimination and which accounts f 


for the advantages of intradi- 
mensional shifts would imply 


progressive improvement in diserimina- 


tion learning rates over a series of experiences with problems involving 


the same dimensions. 
House, Smith, and Zeaman (1964) have pointed up the paradox 
which is to be found in the results of studies of learning set, on the one 
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hand, and of proactive inhibition, on the other. According to the 
former, an individual comes to learn more and more rapidly as a func- 
tion of experience with a particular type of material, but according to 
the latter, he comes to forget more and more rapidly what he has 
learned as a function of the same kind of experience. Both processes 
were brought out nicely in the study of paired-associate learning re- 
ported by House, Smith, and Zeaman. The subjects were mentally re- 
tarded children with mental ages (MAs) of 4-8 years who learned a 
with retention tests averaging about 


succession of paired-associate li 
40 hr after learning. The geometric mean number of errors to criterion 
decreased from 4.0 on the first list to 3.5 on the fourth, 3.0 on the sev- 
enth. and 2.1 on the tenth. Concurrently. percentage recall decreased 
from 86 following the first list to 56 following the fourth, 53 following 
the seventh, and 43 following the tenth. 

is of errors on both learning and recall tests was instruc- 
arning of each list, a very large 


An analys 
tive, showing that during original le 
proportion of errors were intralist intrusions, where 
of errors were interlist intrusions. Thus, during 
primarily the result of failures to differentiate 
and the learning-to-learn effect repre- 
errors of recall were 
primarily a matter of confusing items betwee and the 
frequency of such errors increased with the number of lists that had 
been learned. It should be noted that there is a basic incompatibility 

ficient method of coding the items in a particular 


between the most e £ $ 
list in order to facilitate learning and the most effective method for 


maintaining recall of a large collection of lists. 

These results and related ones suggest that the disappointingly poor 
long-term retention frequently found for school learning may be a re- 
sult of teaching methods which emphasize efficient learning of small 
blocks of material within particular class sessions at the cost of mea- 


sures which are needed to maintain long-te 
iew to improving long-term recall 


on recall tests a 


very large proportion 
learning errors were 
among the items within a list 


sented a facilitation of this process. Conversely, 
n different lis 


rm recall. Some of the mea- 
sures needing investigation with a v e 
are modification of coding strategies to reduce interlist interference, 
techniques for promoting organization of recall, and the use of periodic 


tests and rehearsals. 


5. paired-associate learning 
In terms of the experimental arrangements, a paired-associate task 


simply involves the learning of a number of concurrent discrimina- 
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tions. In practice, the materials used in paired-associate learning * 
periments are largely verbal and this factor greatly increases the soupe 
for transfer and learning-to-learn effects. If a single paired-associate 
item, say the relation between an English word and the corresponding 
word in a foreign language, were considered entirely in isolation. the 


processes involved in acquisition would presumably be essentially 95 
same as those in classical conditioning. On any one training trial. wae” 
resentations of the stimulus and response members of the item an 
associations between them would be established in the temporary 
memory system and over a subsequent interval of time, in the abriet 
of interference, transfer of a portion of this information to long-term 
memory would occur. With new samples of associations being estab- 
lished on successive trials. the result of a sufficiently long series would 
be that the presentation of one member of a pair would evoke in 
memory a relatively full representation of the other, a 

However, it is likely that, in adult human beings, paired assogiates 
are rarely learned in this fashion. Frequently the process can be 
greatly short-cut in that the learner may find some feature or arg 
nent of the stimulus member of a pair which, through some chain p 
brings to mind features of the response member- 
Indeed, in many cases the chain of ass 
name of the response member, 


existing associations, 


ciations is found to include the 
in which case learning may be com- 
plete for practical purposes in a single trial. 

Thus individual differences in the observed rate of learning of single 
paired-associate items must be expected to depend to a major extent 
upon the amount of pre 


4 F Rr e : re- 
vious learning of similar material and upon! 
viously established h 


abits of looking for mediators. However, neither * 
laboratory experiments nor in most practical situations, such as the 
acquisition of languages, are single paired-associate items learned H 
isolation. Generally the task involves the learning of a list, in which 
case the amount of information needed to provide a basis for uniform 
correct response depends upon the number of items in the list and the 
amount of communality between items. 


In general only some subset of the e 
stimulus member of the 
differentiate all of the 


of each 
omponents or aspects of ea N 
5 2 . F LO 
list need be represented in memory in ordet 


2 à * for any 
stimuli from each other. Consequently, for any 
given list, there is some minimal 


which must be represented in mer 
The number of associations whi 


. Nl 
set of stimulus elements or aspé i 
mory in order to permit 100% recall: 


1 1€ 
ch must be established during t! 
r y : Mae 
learning of the list thus depends to an important extent upon the lear! 
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er's ability to discern or attend to features which differentiate among 
the items of the set. By selectively attending to and rehearsing relation- 
ships involving these distinguishing features, an individual can greatly 
reduce the amount of learning that needs to be done before he can 
meet any prescribed criterion of correct performance. The bright col- 
lege student will typically master in a very few trials a paired-associate 
list which might be completely beyond the capacities of a retarded 
child, not because the college student's rate of formation of associa- 
tions is very different but primarily because he needs learn so much 
less in order to achieve correct performance on the same set of mate- 
rial. When the paired-associate situation is especially simplified so as 
to reduce the possible contribution of these various auxiliary pro- 
cesses, differences in acquisition rate between individuals of different 
IQ levels should diminish greatly. There seems little room for doubt 
that the inefficient performance of the mentally retarded individual in 
verbal paired-associate learning reflects to a major extent his retarda- 
nt of various habits of selective attention, search, 
ms to be known, however, concerning the ex- 
tent to which his retarded development is in turn a function of impov- 
erished opportunities to learn relevant habits in earlier life or the ex- 
tent to which the retardation might be overcome by special training. 


tion in the developme 
and rehearsal. Little see 


III. Individual Differences and Developmental Processes 


1. the concept of learning ability 


d within the frame of 
reference provided by modern learning theory? First of all, it is clear 


How are learning abilities to be conceptualiz 


that the notion of learning ability must be abstracted from performance 
in particular learning situations. Substantial individual differences with 
respect to any measure of speed of learning are universally observed 
when a group of individuals, however similar on any selection criteria 
the experimenter may use, are run through an identical experimental 
routine. And this is true whether the situation be as simple as classical 
conditioning or as complex as classroom learning. If the individuals are 
well matched with regard to whatever previous experience is relevant 
to the particular task, then these differences in observed rates of 
learning are attributed to differences in learning ability. 

There is no reason, how 


er, to assume anything like a one-to-one 
relationship between abilities and types of learning tas Rather. 


there are substantial grounds for assuming that differences in observed 


learning rates in any given situation are a joint resultant of individual 
differences in elementary capacities and in cumulative results of past 
learning in the same and other situations. The balance of contributions 
from these sources must be expected to vary from one type of learning 
situation to another. 

By “capacity” I refer to any source of individual differences. deter- 
mined either genetically or by some combination of genetic factors and 
developmental processes which are independent of previous learning: 
It is difficult to be more specific, for to date little progress has been 
made toward identifying particular learning capacities. Their existence 
is inferred indirectly from patterns of differences in observed perform- 
ance and from analogies to better-understood classes of behavioral 
phenomena, for example, those having to do with sensory and motor 
processes. 


Although it is not yet possible to point to any relatively elementary 
learning capacities, comparable say to those having to do with visual or 
auditory acuity in the sensory sphere, there are indications that some 


30) 
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identifications may be realizable within the foreseeable future. For 
example, in the study of phenomena of memory among individuals with 
neurological disorders, some relatively clear cases have been reported 
of individuals who retain normal performance on immediate-memory 
tests but exhibit complete loss of the ability to consolidate material 
which has been entered in immediate memory (Milner, 1966; Wickel- 
gren, 1968). But, if transfer of information from short-term to long-term 
memory depends upon a relatively specific neurophysiological mecha- 
nism which can be selectively eliminated by neurological damage, then 
it seems most likely that there will be found to be individual differ- 
ation of this mechanism and that these 


ences in the efficiency of oper 
contribution to efficiency of per- 


will in turn be found to make some 
formance in tests of memory and in learning situations. 

basis of analogy to better-understood systems, it 
of the anatomical and physiological 
apacities must be essentially complete 
oly within the first year 


Primarily on the 
seems likely that the maturation 
substrate of elementary learning € 
at a very early age in human beings. quite pos 
of life, and that individual differences in rates of maturation can have 
little to do with individual differences in the development of learning 


ability from infancy to adulthood. Differe 


involved in mental retardation: but if so. 
in limits rather than rates of develop- 


may be a more appropriate term than 


nces in capacities may well be 
it would seem that the critical 


differences must lie primarily 
ment. Thus “mental deficiency” 
“mental retardation.” 
There are abundant re 
elementary learning capacities and the 
achieved, we will come into possession of no magic key to the under- 
standing of the normal development of learning abilities or the modifi- 
cation of deficiencies. Except perhaps in infancy, a great part of the 
variance in rates of learning between individuals must be attributable 
to differences in the results of past learning. Observed rates of learning 
in most situations. we now realize, depend to a major extent upon 
of selective attending, 
stimulus information, 
organized. Individual differences in 
ifferences in capacities, but 


asons to believe that even if the isolation of 
ir genetic determiners can be 


habits or strategies seeking of information, 


coding and recoding of 
and the manner in which these are 


these habit systems may indirectly reflect d apac 
termined by variation in motivational s 


vs to learn. Thus, Gagné (1968) speaks 
esis of intellectual development. 
ajor extent on the recombining 


rehearsing, and the like. 


they must also be strongly de 
tems and in previous opportunitie 
of the “cumulative-learning” hypoth 
New learning at any age depends to a m 


32 Individual Differences and Developmental Processes 


of previously acquired discriminations, behavior sequences, principles. 


and concepts. 


2. development of learning abilities 


The differences between the infant and the adult human being with 
respect to the results of experience, that is. the accumulation of knowl- 
edge and skills, are, at the same time, so vast as to defy description and 
so obvious as to be taken for granted. But only the deepened under- 
standing provided by decades of research has begun to make it clear 
that differences in learning abilities between infant and adult are of at 
least the same order of magnitude and are at present far beyond our 
ability adequately to conceptualize. Nonetheless, once our observation 
of learning moves from the uncontrolled situations of everyday life into 
the laboratory, we cannot help but be almost equally impressed by the 
similarities between infant and adult learning. 

Under experimental conditions which minimize the role of volun- 
tarily controlled behavior, the acquisition of conditioned responses 
proceeds at very similar rates in animals, human children, and human 
adults. Similarly the rate of paired-associate acquisition for simple, 
nonverbal stimuli and responses varies little with age over most of the 
life span and differs little between mentally retarded and normal 
groups. And, retention of simple associations once established, if not 
actually independent of age and other indices of mental development, 
at least exhibits no systematic variation that has been detectable by 
research to date. 

To my knowledge the only serious effort toward a theoretical inter- 
pretation of this paradox is that presented by Hebb (1949). I shall as- 
sume a familiarity on the part of the reader with Hebb’s analysis 
though in the present review there will be no occasion to go into 
Hebb’s neurophysiological model. To almost the same extent as was 
the case when Hebb was developing his pioneering formulation some 
20 years ago, an approach to developmental phenomena in terms of 
learning theory today must be largely speculative. The enormous litera- 
ture on changes in sensory, motor, and intellectual performance with 
age is still accompanied by only the scantiest beginnings of a parallel 
accretion of factual information concerning developmental changes in 
detailed aspects of learning. Useful descriptions of the stages of intel- 
lectual development characteristic of children within certain cultures 
have been provided, notably by Piaget (Piaget, 1026; Woodward, 1962), 
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but there remains the task of interpreting the sequences of stages in 
terms of learning processes. 


With regard to the most elementary as- 


S. 


Associative Proce 
pects of the associative process, the preponderance of evidence indi- 
cates that the rate of establishment of temporary associations between 
of behavioral events varies extremely little 


contiguous occurrences 
is extremely resistant 


over the lifetime of the individual and, further, 
In the senescent normal individual and in the 


Ellis. 1968), very-short-term memory of the 
ese temporary associations 


even to brain damage. 
mentally retarded (see, e. 
type presumed to be mediated largely by th 
appears relatively unaffected. 

Quite the contrary is the case with regard to consolidation of these 
t information contained into long-term 
memory, a process which occurs at rates that change with age in a 
The rate of this transition process presumably in- 
creases during early development, though explicit documentation is 
difficult to find, and certainly decreases with the onset of senility. Also 
to disturbances of the nervous system and to 
many types of brain damage. A notable example of the dissociation of 
the processes of establishment and consolidation of temporary associa- 
urological disorder termed Korsakoff's syn- 


associations, or the transfer of the 


curvilinear fashion. 
it is extremely sensitive 


tions is observed in the ne 
drome. 

One source of this disparity may li 
up by any 


e in the vast multiplicity of ele- 

one learning experience. The 
degree of redundancy is probably so great that substantial variation in 
numbers of temporary associations established might have no detect- 
able effect on short-term memory. On the other hand, there is consid- 
erable reason to believe that the process of consolidation, or transition 
igh doubtless modifiable by many conditions, 
given learning experience in an all- 
as yet unidentified, which is 
idence range from the indica- 


mentary associations set 


to long-term storage, thou 
may be initiated following any 
or-none fashion by some mechanism, 


relatively localized. Lines of relevant ev the in 
tion that consolidation of conditioned responses In gold fish is initiated 
e apparatus (Davis, Bright, & Agranoff, 


1965) to data on so-called stimulus selection in paired-associate 
learning (Trabasso & Bower, 1968), and to demonstrations of “block- 


ing” of conditioning by redundant stimuli (Kamin, 1967). 
The changes in the neurophysiological mechanism responsible for 


initiating consolidation which occur in the course of maturation and 


only upon removal from th 
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aging may well be largely independent of external control. At the same 
time, there is no doubt that operation of this mechanism may also be 
influenced to an important extent by habits of rehearsal and tactics for 
protecting newly formed associations from interference. Most attention 
tends to be given in the literature to the long-term possibility that func- 
tioning of consolidation or storage mechanisms might be improved by 
drugs or other physiological manipulations. But the more immediate 
fruits of research in this area, which might yield tangible improve- 
ments in learning efficiency on the part of normal individuals as well as 
the retarded, would seem to be the habits of selective attention and 
rehearsal and other behavioral tactics that can be developed by training 
and brought under voluntary control. 

The other side of the short-term retention process perhaps merits 
more attention than it has had in the literature. Efficient: performance 
in many situations may depend as much upon efficient short-term for- 
getting as upon long-term retention. Consider, for example, the plight 
of the operator of a calculating machine if he were unable to clear from 
memory the input from each successive display on the keyboard. In 
this type of situation, retention may be prevented to a considerable 
extent by the individual's switching attention rapidly from one display 
to the next, thus minimizing the interval free of interference which is 
evidently needed for the initiation of the consolidation process with 
respect to any one input. 

Although there seem to have been no systematic empirical studies of 
developmental changes in the efficiency of “short-term forgetting.” 
indirect evidence from work of Spitz and his associates (1959, 1969) 
suggests that there may be significant differences between retarded 
and normal individuals in the speed with which shifts of attention 
occur. Unfortunately, at present there seems to be no basis whatever 
for judging to what extent the observed differences reflect retarded 
development of some as yet unidentified neurophysiological processes 
or indirect effects of other developmental processes which may de- 
pend, at least in part, on previous experience. 


Formation of Higher-Order Behavioral Units. The major 
source of the difference in rates of conditioning or associative learning 
generally observed between immature and mature organisms is simply 
that the former have more to learn in order to meet the criteria for 
learning in any given situation. Very early in development, if each of 
two events which occur in contiguity comprises a number of aspects, 
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components must be formed and consoli- 
dated before the learning process can be considered complete. Sup- 
pose, for example, an event with components A, B, and C were fol- 
components D, E, and F. Temporary 
associations would be set up between components of the first event 
and components of the second. However, in the absence of rehearsal, 
ssociations formed on the given 
learning experience would subsequently be consolidated. Since the 
tween active and inactive states 
f the first event some of the asso- 
ight be inactive. Owing to these 
equired before enough associa- 


associations among all of the 
lowed by an event with 


only some random sample of the a 


elementary associations fluctuate be 
over time, upon the next occurrence 0 
ciations formed on the first occasion m 
two factors, repeated trials would be r 
tions could be formed to provide a bas for reliable recall of the 
nee of the first. If, however, in the course of 
events were to recur many times, 
ons would form among the 


second event upon occurre 
the organism's development these 
whether together or separately, associati 
components of each, so that ultimately the occurrence of any compo- 
nent of either event would tend to reinstate in memory a representa- 
tion of the whole of that event. If the two events first occurred in conti- 
guity after each had taken on such a unitary character, one might 
~ Associations between a constituent of the 
onstituents of the second would be suffi- 
former, to reinstate in memory a rep- 


observe “one-trial learning. 
first event and any of the e 
cient, upon the recurrence of the 
resentation of the latter. 

Hebb (1949) first assembled the 
neurophysiological interpretation o 


slow learning.” During this period recurring 
integrated through multiple associative 


us preparing the way for the facile, all-or- 
of the mature individual. Analyses of fac- 
‘reading readiness™ have demon- 
ment over the ages of four to eight years in 
ns of critical features of stimuli, for exam- 
pear in printed material (Gibson, 1965). 
environment, including especially 
or lack of feedback from older 
S. must be expected to retard 


empirical evidence and offered a 
f the important developmental pe- 
perceptual and 


riod of 
motor patterns become 
learning into higher units, th 


none learning characteristic r 
tors involved in the development of * 
strated progressive improve 
ability to recognize transforr 
ple, letter-like forms, which ap 
Impoverishment of a child’s early 
any major restrictions on play activity 


individuals with respect to vocalization I 
severely this important process of development of behavioral units and 


thus in turn to restrict the range of situations in which the individual 


could be expected to manifest normal learning rates in later life. 


1% Individual Differences and Developmental Processes 


Further, substantial retardation of this developmental phase of slow 


learning, even if caused entirely by environmental circumstances, is 
likely to prove almost irreversible. Firstly. in the absence of adequate 
stimulation and activity. neurophysiological mechanisms involved in 
normal adult learning may fail to develop (Haywood & Tapp, 1966: 
Hebb, 1949). Secondly. and perhaps more importantly, conditions in 
impoverished environments, such as custodial institutions. generally 
do not provide variety and duration of exposures to diverse elementary 
perceptual-motor experiences comparable to those available to the 


normal child during the first few years of life. Efforts, however persist- 
ent, to train or educate the older retarded individual by means of the 
teaching and training techniques appropriate to a normal individual of 
similar chronological age (CA) must be expected to prove largely inef- 
fectual, since they do not compensate for the retarded individual's 
impoverished repertoire of stimulus and response patterns. 


Selective Attention. Development of the ability to react to recur- 
ring stimulus compounds as unitary patterns and the integration of 
recurring response sequences into higher-order units, important 
though they are, would not suffice to account for the efficiency of the 
mature individual in learning, often on a one-trial basis, relations in- 
volving new combinations of properties or events as, for example, the 
names of newly encountered people or the identifications of new plants 
or animals. An important factor in the development of this ability is the 
acquisition of habits of selective attention with respect to the stimulus 
sses of 


properties which differentiate the members of important cl 
objects or events. Thus, what is termed “reading readiness” involves 
acquisition of the habit of distinguishing printed black-and-white pat- 
terns such as letters on the basis of contours, while ignoring differ- 
ences in size of the retinal image or position in the visual field. 

Beyond facilitating the learning of particular discriminations, the 
development of habits of selective attention for the stimulus properties 
which are relevant over whole classes of tasks or situations prepares 
the way for the emergence of organization in identification and recall. 
A relatively naive or immature individual, in learning to identify mem- 
bers of a class of objects, must begin with an essentially random pro- 
sociations be- 


cess. On early trials random subsets of the possible a 
tween each object encountered and the proper identifying response 


may be formed. Since, in general, the objects in the class will share 


many stimulus properties or components, the associations thus formed 
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will not suffice to permit identification without confusion. Thus, on sub- 
sequent trials the individual must sample additional elements, then 
ssociations until enough have 


rehearse and consolidate additional a 
discrimination of all objects 


accumulated to provide a basis for reliable 
in the class. 

At a later state of development, the individual may have learned by 
experience which stimulus dimensions are relevant and which are ir- 
relevant for distinguishing members of a particular class of objects 
(Bower, 1967). Then upon encountering each new instance he need 
only note the “values” of the given object on a limited number of stim- 
ulus dimensions, and by utilizing associations involving these can 
immediately establish the basis for efficient identification. Thus, a 
rn European culture, upon encoun- 


tering a group of people from an Asiatic country, finds it very diffi- 
cult to associate names with faces and complains that these people “all 
the child is expressing the fact that the 


group resemble one another on a 
dimensions (for example, hair and 
to using as a basis for identification 


child who has grown up in a weste 


look alike.” In this complaint, 
members of the newly encountered 
number of the important stimulus 
eye color) which he is accustomed 
and recall. 


Although detailed experimental anal- 


yses are lacking, it seems clear on theoretical grounds that combina- 
tions of these developmental processes that have been separately iden- 
tified may generate at least part of the higher-order learning 
phenomena which characterize the normal human being beyond the 
earliest developmental stages and which appear superficially to be 
qualitatively different from forms of learning observed in the young 
child, in the mentally retarded, and in animals. An important case in 
Point is the ability of the normal human being to profit extensively from 


the experience of others. The efficiency with which older normal chil- 


dren and adults avoid most of the dangers in their environments 
ad avoidance conditioning, for 


cannot be explained in terms of escape a! fo 
are never experienced. Simi- 


most of the potentially traumatic events r 
ed activity cannot be accounted for 


larly, much constructive, goal-direct 5 
in terms of simple reinforcement principles, because the rewarding 
rs to be directed have 


consequences toward which the behavior appears to be d 
the particular individual. 


never previously been experienced by cular 
An important factor in this type of behavior is undoubtedly the 
learning of associations between behaviors and rewarding or punishing 


Observational Learning. 
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consequences by observation of other individuals, which Bandura 
(1965) terms “learning from models.” Learning from models would be 
difficult to account for within the framework of the traditional law of 
effect or reinforcement theory, in which the basic conditions of behav- 
ioral modification are held to be selective strengthening or weakening. 
as a result of immediately following rewards or punishments, of re- 
sponses made by the individual. Within the theory outlined in the pre- 
ceding section, however, the role of rewards and punishments in 
learning is always indirect in the sense that the organism must first 
learn relationships between behavioral events and rewarding or pun- 
ishing consequences, and then modify its process of response selection 
on the basis of anticipated outcomes. 

The first stage in the development of the ability to learn from models 
must be the process of stimulus patter 


ning we have alr 
The organism must learn to re 


dy discussed. 
scognize environmental events which are 
sources of reward or punishment rega 


irdless of whether these occur as 
a consequence of its own actions. Once this lear 


ning has occurred, the 
organism will be in 


a position to learn relationships between these 
events and others simply by perceiving the 
Clearly this process can proceed without verbal mediation: for exam- 
ple, John, Chesler, Bartlett, and Victor ( 1968), gave a clear 
demonstration of virtually instant 
pres 
pres 


m in temporal contiguity. 


experimental 
aneous learning by mature cats to 
a bar for food following the opportunity 


ing the bar which activated the food- 
cats which apparently learned by 
emphasized, imitate the 
rather they behave 


to observe other cats 
dispensing mechanism. The 
observation did not, it should be 
precise movements made by the 


d as though they had learned to anticipate that 
movement of the bar would be followed by the appearance of food in 
the dispenser and thus 


s, when permitted to do so 
behavior by selecting responses which brought them into contact with 
the bar and resulted in its movement, even though these were not the 
same response patterns exhibited by the “models.” 

The researches of Bandura and his as 
erate an account of the way 
order learning develops in th 
Grusec. & Menlove, 1967), H 
of this process in specifie relation to mental retardation: 
has indeed been overlooked in research to date 
tain that it would repay serious 
table that 


“models” — 


+ modified their own 


sciates have begun to gen- 
in which this important form of higher- 


e normal child (Bandura, 1965: Bandura, 


owever, I have seen no reports of studies 


if this area 
it seems almost cer- 
investigation. 


c It seems almost inevi- 
one of the respects in which the en 


vironment in custodial 
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institutions must differ most drastically from that in normal homes and 
‘s for constructive learning from 


schools is 


s with respect to opportunit 


mode 


Development of Motiyational Systems. There is no reason to 


doubt that the motivational systems and emotional dispositions which 


contribute to normal adult behavior are the result of as prolonged and 
intricate a developmental process as learning and problem-solving abil- 
owing to greater difficulties 


ities. However, experimental analysis lags, 
of measurement and control. Consequently any interpretation of the 
development of motives in terms of learning theory must at present be 
somewhat speculative and fragmentary. It is easy enough to contrast 
the behavior of the very young child, closely bound to the effects of 
immediate reward and punishment, with the extended and organized 
behavior systems of the adult which are often only remotely and ten- 
uously related to rewarding or punishing consequences. But at present 
it is difficult to do much toward filling in the gaps in the developmental 
former to the latter. We do know that orga- 
systems do not emerge from an orig- 


sequence leading from the 
nized motivational and emotional 
inal condition of randomness. Even in the very young organism, reac- 
tion tendencies associated with various unconditioned stimuli show 
clear evidence of organization into systems. The taste of food by the 
hungry infant leads to facilitatory feedback which increases the proba- 
age of approach and ingestive behaviors, 
inhibition of positive systems and 


bilities of a considerable rar 
whereas a painful stimulus produce 
facilitation of the families of defense and withdrawal reactions. By 
virtue of the associative process discussed in the previous section, the 
events which precede these unconditioned rewarding and punishing 
stimuli become assimilated into the motivational systems: approach 
facilitated by stimuli that have preceded food, 


hich have preceded pain. 
n individual's environ- 


behaviors come to be 
withdrawal by stimuli w 

The balance of reward and punishment in a i 
ment is doubtless of the greatest moment with regard to the course of 
behavioral development. An ample supply of rewarding events must 
be available if the young organism's tendencies to explore its environ- 


ment are to be strengthened and le 
sequences leading to various type 


important for normal motiva- 
latively free of punishment. 


ad to the acquisition of a normal 

f $ sofi - 
repertoire of behavioral s of incen 
tives and goals. Probably almost equally 
nvironment re 


tional development is an er i 
ishment which seems to me best estab- 


According to the theory of pun 
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lished (Estes, 1967, 1969) and which has some rather cogent 7 0 
physiological as well as behavioral support (Miller, 1963: Stein, 15 
an important consequence of the traumatic or painful event is that i 
leads to immediate inhibition of currently active 
tems. As a result of conditioning, 
would thus in turn come 


positive drive sys- 
events which precede punishment 
to produce inhibition of positive drive sys- 
tems. Consequently, extensive experience 
have the result of crippling the 
systems. I hasten to add, howe 
taken too broadly as a prescrip 


with punishment would 
development of positive motivational 
ver, that this observation should not be 
tion against the use of punishment for 
controlling behavior of children, retarded or otherwise. The implication 
is, rather, that the use of punishment should always be combined with, 
or, to put it differently, compensated for, by reward for more desirable 
alternative responses, As, for example, in the work of Lovaas, Schaef- 
fer, and Simmons (1965) with psychotic and autistic children, where 
even rather severe punishment may not have obvious effects in re- 
tarding the development of positive 
with lavish administrations of reward 


On the basis of learning theory it must be assumed that the develop- 
ment of more generalized “achievement motives” (McClelland, Atkin- 
son, Clark, & Lowell, 1953) depe 
sequences of succe 
fortune 


motivational systems if combined 
S; 


nds upon occurrence of appropriate 
ss experiences in earlier life 
or to good manageme 
Thus, to take a relative 
had a sequence of su 
arithmetic problems, 


, owing either to good 
nt on the part of parents or educators. 
ly simple example, in the case of a boy who has 
ccesses in solving progressively more difficult 
with consequent indications of approval and es- 
teem from teachers and parents (perhaps quite in contrast to the results 
of his efforts in other school subjects) 


lems and the experience of intellectual effort i 
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lowed by i t 
y cont ade te s vti me E ici 
inued effort comes to function as a signal for anticipated 


reward rather r i fai 
ward rather than for continued failure. 


—— A major overall trend in the development of 
i a modes of adult learning and performance is the progressive 
i: pa * Sale processes which are originally overt. At a 
5 evel, this trend is observed in the development of 
aes — ene even in animals. The simplest discrimi- 

s involving differential response to stimuli which have 


elements in e if i 
ments in common cannot be solved if the organism responds overtly 


to a given stimulus display without first inspecting all of the elements 
or component features. If an overt response is evoked by the first cues 
sampled, sometimes these will be the elements common to the stimuli 
being discriminated. Over a series of trials in almost any discrimina- 
EAA — 5 lead to decreases in frequency of re- 
: e organism learns to slow down at choice points and ex- 
pose itself to the relevant cues before making an overt choice. 
In the normally developed human being, this type of learning gener- 
alizes to new situations. Ultimately the individual tends, when any new 


problem situation presents itself, to withhold overt response until the 
h might be initiated have been 


d which generates the highest 


tic ack n 
in task premature responses 


alternative response sequences whic 
examined covertly and the one selecte 
expectation of a rewarding outcome. This process of covert scanning of 
hoice is evidently conspicuously lacking in the 
aracterized by such terms as 
s of this type of deficit are, 


alternatives prior to e 
mentally retarded, who are commonly ch 
impulsive“ and “distractible.” The source 
however, little understood and doubtless merit further investigation. 

It seems to be a rather common assumption that there may be some 
relatively localized organic basis for the inability of the retarded indi- 
vidual to inhibit premature responses. or to resist distraction by irrele- 
vant stimuli during the period of choice. However, one cannot at 
present rule out the possibility that these deficits are largely a result of 
an absence of the necessary conditions for the learning of habits of 
“yiearious trial and error.” On theoretical 
t habits of this type to generalize to broad 
first been learned in relatively spe- 
cific contexts. And meeting this condition must require experience 
with sequences of problems of appropriately graded difficulty which 
permit individuals to learn by experience that some modes of re- 
sponding lead to substantial increases in the likelihood of reward. If 


stimulus scanning and 
grounds we cannot expec 
classes of situations until they have 
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discriminations encountered during the early developmental stage are 
typically too difficult, so that attempts at solution lead to no detectable 
increases in reward input, we must expect that the general response 
tendencies learned will be those which most quickly te 


rminate expo- 
sure and lead to escape from the frustrating situation. 


Part B 
Empirically Oriented Research 


IV. Heritability and Development of Learning Ability 


My main purpose in this study is to analyze the role of learning 
theory in research on mental development, not to survey the empirical 
literature. In this section I have attempted only to characterize the 
empirical side of the interaction between theory and experiment, the 
relationship which is the main focus of attention in the remainder of 
the volume. From the very large body of research reports considered, | 
have selected those which seem directly relevant to this relationship. 
Where appropriate I have discussed the adequacy of methodology and 
the interpretability of findings in terms of the presently available body 
of learning theory summarized in Chapter J. The empirical findings 
the reader some perspective with respect 


cited in this chapter will give 
as a basis for the- 


to the body of experimental facts presently available 


oretical development. 


l. genetic studies 

d contribution of inheritance to indi- 
vidual differences in intelligence (see Hilgard & Atkinson, 1967, for an 
introduction to the extensive literature), surely few psychologists doubt 
that variation in learning ability, within as well as between species, is 
in part genetically determined. However. positive support for this be- 
lief is based largely upon general biological principles rather than upon 
specific research findings. The relatively new discipline of behavior 
genetics has advanced rapidly during the last two decades (Fuller & 
Thompson, 1960; MeClearn & Meredith, 1966) but still appears consid- 
erably short of being able to shed light on the heritability of specific 


learning capacities. 
Much of the earlier researcl 


In view of the well-documente 


a in this area has gone into demonstra- 
Is which differ markedly with regard to per- 
ng situations can be generated by selective 
work relating behavioral differences to 
so far, been limited to studies of lower 

& Erlenmeyer-Kimling, 
of rats in rate of maze 


tions that strains of anima 
formance in standard learni 
breeding. Not surprisingly. 
chromosomal variables has. 
organisms, principally Drosophila (Hirsch 
1967). Large differences between strains 
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learning (Heron, 1935, 1941; Tryon, 1940) or between strains of dogs uf 
rate of conditioning (Fuller, 1955) are readily produced by selective 
breeding, but scarcely a start has been made toward analyzing tte 
behavioral differences between strains in terms of specific learning, or 
motivational processes. 

The first attempts to relate differences 
maze-dull strains of rats to specific, the 
learning were not very fruitful 
the associated gene 


between maze-bright and 


oretically significant aspects of 
in part because of the primitive state of 
tic analysis (Estes, 1942: Heron & Skinner, 1940). 
Recent work with highly inbre 
promising. For example, Bove 
striking differences betwee 
variables upon avoidance ¢ retention of avoidance re- 
sponses and suggest an interpretation in terms of 
tion of memory storage. Although one still ¢ 
interaction between learning the 
grounds for expecting a more 
plines to develop. 


d strains of mice appears rather more 
t, Bovet-Nitti, and Oliverio (1969) report 
n strains with respect to effects of temporal 
conditioning and 
a two-stage concep- 
an find few points of direct 
ory and behavioral geneties, there are 
fruitful relationship between these disci- 


2. developmental studies 


Presumably information relevant to the 
learning capacities must come 
learning in children at the 
primarily during the 
during this period is 
date has gone into 

Even the 


maturation of elementary 
largely from analyses of simple forms of 
earliest stages of development, perhaps 
first year of life. Howe 
beset with difficulties 
developing appropriate me 
fragmentary available 
conducted with differing 
doubt about the gross e 
to demonstrate 
aversive 


ver, research on learning 
and most of the effort to 
thods. 

evidence, based on experiments 
procedures at different age levels, leaves little 
mpirical fact that conditioning is very difficult 
during the first few weeks of life. Studies of classical 


conditioning in infants 1-3 weeks old have yielded conflicting 
and tenuous results, 


us re and * it must certainly be concluded that 
such conditioning is at best difficult to obtain“ (Lipsitt. 1063. p. 162). 
Evidence for classical appetitive 


conditioning in infants of this age 
range is somewhat firmer, positive results having been reported by 
Irwin (1930), Marquis (1931), and Lipsitt and Kaye (reported 
Operant conditioning has bee 
of age by Brack 
ing manifest in the forme 


1963), among others. Simple 
strated in infants 3-4 months 
(1963), learning be 


in Lipsitt, 
n demon- 
bill (1958) and Weisberg 
r case by “returning a 
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smile“ and in the latter by changes in vocalization as a result of “so- 
cial” reinforcement. An operant discrimination between red and blue 
stimuli by 12-month-old infants has been reported by Simmons (1964). 
In no case do there seem to be available any data based on fully com- 
parable conditioning experiments conducted on infants at a sequence 
of age levels. A review by Lipsitt (1963) indicates that, although empir- 
ical results consist largely in demonstrations that conditioning and 


other simple forms of learning can be produced in infants, advances in 
methodology are increasing the prospects for more analytical work. 
Even at the animal level there have been few systematic studies 
permitting well-controlled comparisons of learning functions on the 
same tasks at different ages. Reviewing a series of studies of sub- 
Harlow (1959a) reports that for monkeys, classical 
conditioning was as rapid at 5 days of age as in adulthood. Rate of ob- 
ject-discrimination learning, on the other hand, increased substantially 
with age. Trials to criterion decreased from about 200 at 30 days to an 
asymptotic level of about 10 at 150 days in comparisons involving ani- 
mals first exposed to the experimental situation at the given age. The 
persisted longer than adults in object or position pref- 
often from one to another incorrect hypoth- 


human primates, 


younger animals 
erences and shifted more 


esis. 
It would be of prime inte 


mine whether similar trends 
that has come to my attention was reported by Hill 


(1965). Groups of children were first tested on object-discrimination, 
oddity, and conditional-oddity learning at ages 1, 4, 6, and 12 years. 
The 50% thresholds,” that is, the ages at which 50% of children 
tested were able to meet a learning criterion, were found to be at 1 
year for object discrimination, at 6 years for oddity problems, and 6-12 
years for conditional oddity. At all ages, children mastered any given 
discrimination within about 40 trials if at all. Backward learning curves 
were similar in form to those reported by Zeaman and House (1963) for 
discrimination learning by retardates: initial plateaus at a chance level 
of responding terminated in abrupt transitions to criterion perform- 
mediately comes to mind is that chil- 

did not depart from the initial plateau 
like the immature monkeys, not 

ig in position or stimulus pref- 


rest from a comparative standpoint to deter- 
obtain for human children. The most 


relevant study 


ance. The hypothesis which imr 
dren whose performance curves 
on any given type of problem were, 
responding randomly but rather persistir 
erences which were incompatible with solution. 

A bit of information possibly bearing on this hypothesis is to be 
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found in a developmental study of color-form discrimination reported 
by Milgram and Furth (1964). Groups of 6- and 9-year-old normal chil- 
dren and retarded groups matched to them for MA were given tr 
on original discriminations followed by various shifts, 
sitional reversals and dimensional reversals are of particular interest. 
During original learning, the correct cue on the relevant dimension, 
either color or form, always appeared in the 
tional reversal, the color or form dimension b 
correct position was changed. On dime 
assignment remained the same, 
relevant dimension were interch 


aining 
of which the po- 


same position. On posi- 
ecame irrelevant and the 
nsional reversal, the position 
but correct and incorrect cues on the 


anged, so that performance should be 
susceptible to substantial facilitation by attentional or mediating re- 


sponses. In terms of errors to criterion, clear relationships to age and 


normal-retarded differences within age groups were exhibited only on 
dimensional reversal. 


Similarly, in the case of probability learning, 
crease with age in rate of approach to a probability-matching level of 
performance has been shown to be attributable at least in part to the 
greater tendency of younger children to persist in systematic, but inap- 
propriate response patterns, such as alternation (Craig & Myers, 1963: 
Stevenson & Odom, 1964). In some variants of the probability-learning 
experiment, been found to in- 
uggesting that the 
arning of habits of ap- 
ms rather than a simple maturational 


the progressive in- 


however, alternation tendencies have 
crease with age to a maximum and then decrease, s 
shifts in response patterns may reflect the le 
proach to specific types of proble 
process (Weir, 1964). 


y now available to interpret 
learning, it seems that well- 
paired-associate learning, 
at different ages, 
the development of verbal 
of the literature on verbal 
evant studies (Goulet, 1968; 


free 
might sub- 


years to college level, and this functio; 
a measure of clustering in recall (Bousfield, E 
1958; Cole, Frankel, & Sharp, 1970, in press). Further, the tend 

organize recall by means of clustering has be : Sen Sa 
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little over trials within an experiment in the case of 5-10-year-old chil- 
dren but to increase steeply in the case of college students and older 
adults (Laurence, 1966). 

A recent study by Moely, Olson, Hawles, and Flavell (1969) yields 
evidence that the child’s ability to utilize clustering when appropriately 
instructed and his tendency to cluster spontaneously follow quite dif- 
ferent developmental trends. Children in kindergarten and first grade 
exhibited little spontaneous clustering and increased their clustering 
activity only in response to specific instructions, whereas older chil- 
dren required little prompting. However, at all ages clustering, once 
initiated, improved recall. Cole, Frankel, and Sharp (1970) report that 
increases in recall as functions both of age and of amount of clustering 
are largely confined to the early and middle portions of the recall list, 
as would be expected if the increases reflect more effective strategies 
for retrieval from long-term memory. Evidently the spontaneous use of 
rather general strategies for improving retention tends to increase as a 
function of age, but little is yet known as to the specific kinds of expe- 
riences which are responsible. A promising direction for further re- 
search is suggested by the success of Kingsley and Hagen (1969) in 
producing shifts toward the adult pattern when they gave young chil- 


dren specific training on retrieval skills for their serial-order memory 


‘task. 
Age-related trends in retention of nonverbal material may be of more 
relevance for the analysis of maturational processes, but even here it is 
difficult to partial out the role of verbal mediation. To take one exam- 
ple, Paraskevopoulos (1968a, b) has conducted a rather extensive de- 
velopmental study of short-term memory for dot patterns in retarded, 
normal, and gifted children. Age groups ranged from 6-7 to 11-12 
years for the gifted and from 8-9 to 15-16 years for the retarded. Each 
group was tested on random dot patterns and on patterns character- 
ized by bilateral, horizontal, or double symmetry. For each condition, 
retention scores, in terms of mean errors in reconstructing patterns 
from memory, declined as an orderly function of age. The pattern of 
relationships among the separate curves for patterns with different 
types of symmetry was very similar for the retarded and gifted groups, 
but the gifted’s rates of decline of the age functions were greater and 
the apparent asymptotes lower across the board. It is difficult to judge, 
however, the extent to which these trends can be taken to support the 
author’s interpretation in terms of development of perceptual organiza- 
tion, for patterns characterized by different types of symmetry are 
amenable in differing degrees to verbal recoding and rehearsal. 
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More nearly direct evidence concerning the operation of linguistic 
habits in producing age-related changes in retention comes from a 
study by Flavell, Beach, and Chinsky (1966). 
spontaneous verbalizations showed an increase 
likelihood of rehearsing stimulus 
test. Further, within the middle 
breakdown was feasible 


Records of children’s 
from age 6 to age 1l in 
names in a nonverbal serial recall 
age group (the only one for which the 
) verbalization was directly rel 
on both immediate and delayed re 

Theoretically rele 


ated to retention 
call tests. 
vant data concerning deve 


lopmental changes in 
learning processes have bee 


n forthcoming largely for the period from 
ages 4-7, the first few years of school. The relatively heavy concentra- 
tion of effort upon this period is understandable since it is during these 
years of childhood that the individual's behavior comes to a marked 
extent under the control of language and therefore 
learning in the child begin to de 
lower animals. 


One of the earliest major studies of discrimination learning by chil- 
dren in this age range demonstrated an orderly increase in the tend- 
ency to transpose stimulus relationships (Kuenne, 1946). In another 
well-known series of studies, the Kendlers (Kendler and Kendler, 1962: 
Kendler, 1962) found ach during this period from faster nonre- 
versal shifts in younger children to faster reversal shifts in older chil- 
dren following similar original discrimination training. Also, 
a progressive increase over the age range 3-10 years in the 
of children who reversed a previously learne 
second task permitted a choice betwee 
shift. 

These types of findings have 
within this age range the child co 
relationships and that verbal 
transfer of discrimination learnin 
pretation may be applicable to White’s (1966) findings th 
postion of pome ed ea Jhe negative sina pe 


‘ 18 z f discrimination- 
learning shift from negative to neutral or positive durin 
of about 5-7 years. 


Summarizing the re 


that modes of 


part markedly from those found in the 


ange 


they found 
proportion 
d discrimination when the 


n a reversal and a nonreversal 


been taken to Support the idea that 
mes increasingly to ve 


rbalize stimulus 
me 


facilitate the 
similar inter- 
at the effects 


diating responses 
8 to new situations. A 


g the age range 
search liter 
5 and 7 years of age, White fi 
rather general developmental tr 
mediating responses, 


ature on learning in children be 
erable evide 
(a) increasing us 
Tizing, and the li 


tween 
nee for three 
e of linguistic 
ke, as aids to 


nds consid 
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learning and performance: (b) increasing ability to maintain orientation 
toward “invariant dimensions of stimuli”; (c) increasing ability to 
“string together representations of stimulus-response- consequence 
into sequences which, projected into the future, allow planning, and 
projected into the past, allow inference” (1965a, p. 210). 

On the whole, it appears that learning in children prior to about age 
5 proceeds in similar fashion to the learning observed in comparable 
Except perhaps for some aspects of lin- 


experiments with animals 
guistic development, this early learning is directly interpretable in 
terms of associative principles. But, although the character of human 


learning changes notably beyond age 6 or 7, with steeply increasing 


control of behavior by verbal rules and strategies, it need not be as- 
sumed that associative processes cease to operate. 

White (1965 a. b) has assembled a number of lines of empirical sup- 
al organization of learning pro- 


port for the conception that a hierarchic 
ontinues to 


cesses develops with age. The older child, as the adult, c 
learn in accord with associative principles, but also comes to inhibit 
responses arising directly from stimulus-response associations when 
these conflict with higher-order, cognitive processes. A novel source of 
evidence which White (1965b) brings to bear upon this hypothesis is 

d time taken to 


the relationship between correctness of response an 
respond in discrimination learning. For the youngest children tested 
(age 34-44) the lowest latency responses are most likely to be correct, 
and the curve of proportion correct versus latency increases monotoni- 
cally, being apparently asymptotic beyond 2 sec. With increasing age, 
this function takes on an inverted S shape. For older children (age 
1) and for adults on stimulus generalization tests, probability that 
reases as a function of latency to a minimum 
80 to 1.0 sec, then de- 


a response is correct dec 
at .25-.50 sec, increases to a maximum at 
creases at longer latencies. 

White interprets this latter function in terms of a 
ing” of alternative responses which may be initiated by a given stim- 


ulus. In the course of learning, the older child or adult comes to inhibit 
ies to stimulus presentations and performs 
in accord with verbal rules or other higher-order modes of response. 
On some occasions, especially under stress. the higher-order processes 
become temporarily inoperative, leading to the disinhibition of incor- 


rect stimulus-response associations. 


“temporal stack- 


his initial response tendenc 


V. Empirical Comparisons between Normal and Retarded 
Populations 


The vast majority of studies of learning in the retarded up to the 
present, and virtually all of those done before the first major reviews of 
the literature appeared in the early 1960s (e.g., Denny. 1964), were 
concerned with comparisons between groups of normal and retarded 
subjects in various standard laboratory situations. The groups un- 
dergoing comparisons have ordinarily been defined by performance on 
standard intelligence tests, but occasionally by grosser criteria such as 
success or failure in school, or ability to get along outside a custodial 
institution. With the exception of a few current research programs, 
which will be discussed in detail in later sections, these researches 
have not been guided by analyses of the problems of retardation in 


terms of learning theories. The underlying assumption, never fully ex- 
plicated, seems to have been that these group comparisons might 
somehow aid in localizing the sources or types of deficit in intellectual 
functioning that characterize the mentally retarded. 


1. methodology 


Within the framework of comtemporary learning theory, how can we 
characterize what is being done when intellectual ability is appraised 
by means of standardized tests? Samples of performance are taken on 
a variety of tasks, many of which would be expected to involve learning 
explicitly (for example, tests of general information vowabulary arith- 
metic reasoning) but usually with some admixture of tasks that are 
assumed to be relatively free of effects of Specific previous learning 
experiences. Since the previous learning 

has gone on outside the testing situation, we are sampling performance 
at particular points on learning curves, generally with no 7 — of the 
form of the learning function and with no way of identifying the points 
on the abscissas at which samples are being drawn. We hee ihat in 
virtually all of the learning situations that have been Studied in the lab- 
oratory, individual differences appear in initial levels of performance, 
52 
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terminal levels of performance, and rates of approach to asymptotic 
levels (see Gagné, 1967, for reviews of the literature relevant to this 
generalization). This ubiquitous variation in parameters of empirical 
learning functions is almost universally assumed to be partly attribut- 
able to previous experience and partly to organismic variables which 
may in part reflect innate structural differences among individuals. 
Why should test scores based on these samples of behavior in very 
limited situations predict performance in school and in everyday life? 
thods of test construction insure this result by arranging that, 
extent. the tests sample what the individual has 
age of testing. To the extent that the sampling is 
standing in a group with respect to the skills 


The me 
to a considerable 
learned up to the 
adequate, an individual's 
and knowledge represented in the test must, statistically, predict his 
relative standing with respect to the larger population of skills and 
knowledge being sampled. Further, if different individuals tested at a 
same opportunities to learn, test performance 


given age have had the 
presenting rates of gain, that is, the 


must in part reflect parameters re 
ese individuals have profited from the same amount 


ce, and thus should predict rates of gain from ex- 
perience in new situations. If, however, opportunities to learn differ 
among these individuals, then, of those receiving the same test scores 
having had fewer previous opportunities to 
learn must have higher rates of gain and thus should be expected to 
amount of experience on new tasks. 

being done when normal and re- 
tarded individuals are compared on specific laboratory learning mea- 
sures? What could one hope to accomplish from such comparisons? 
What controls should be employed? When we define groups of normal 
and retarded learners on the basis of test scores and compare these 
groups on a variety of laboratory tasks, we obtain an alternative de- 
scription of differences in performance between these groups. What 
might we hope to gain hy replacing a description of differences between 
the groups in terms of subtests of the Stanford - Binet scale or the like 
with a description in terms of differences in performance in eyelid 
conditioning, maze learning, paired-associate acquisition, and similar 


laboratory learning tasks? 

It seems likely that one main source of motivation for these studies 
is similar to that of the investigators of mental abilities who have at- 
tempted to reduce the concept of general intelligence to one of varia- 
on in a small number of traits or factors (e. g. Spearman, 1927; Thur- 


extent to which th 
of previous experien 


at a given age, the ones i 


gain more from the same 
Within this framework, what is 


ti 
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Stone, 1938). The analogous objective here 
description of differences in mental ability now given by scores on het- 
crogeneous tests to description in terms of differences revealed by lab- 
oratory experiments in a small number of presumably more unitary 
and basic functions or Capacities (for instance, conditionability, sus- 
Cay rate of stimulus traces). 

relatively ambitious objective should prove 


cription of mental 


might be to reduce the 


zipate the conclusions 
that progress in this 
uit to be anticipated 
and normal individuals 


up to the time of testing. 
this purpose because dura- 


Is of weeks or 


months as opposed to the single brie involved in most 


learning studies to date. 


Secondly, major attention is neede 


ments of mental development, 
therefore in the rates of improve pected on new i i 
be inferred from performance a i ime only if previ 78 0 
nities to learn have been equated for individ ine be 
must be emphasized further that control is required not ly f. 

vious opportunities to learn the specific mate only for pre- 
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test, but also for previous opportunities to learn other habits or skills or 
to develop motives or emotional dispositions which might influence 
performance at the time of testing. 

Lack of regard for this point in the literature on mental retardation is 
hard for the outsider to credit. It is not uncommon to see comparisons 
of performance on laboratory tasks between groups differing as radi- 
cally with regard to previous opportunities to learn as institutionalized 
retardates and college students of similar chronological age. Under 
such circumstances differences in performance on the reference task 
must be expected to reflect some unknown mixture of effects of differ- 
ences in past opportunities to learn and in the type of subject variable 
associated with the notion of capacity (not to speak of other determi- 
ners of performance such as motivation and specific skills in dealing 
with test situations). 

On theoretical grounds I can see no way to avoid, and little room to 
ameliorate, the conclusion that when differences in previous opportuni- 
ties to learn and the relationships of these to the reference task are 
unknown, comparisons between different groups of individuals are of 
little value for any theoretical purpose. Investigators in this area would 
doubtless reply that it is difficult in practice to equate groups of sub- 
jects on variables other than test scores or chronological age. This 
difficulty is surely amply appreciated by all who have had any famil- 
iarity with this research area. Nonetheless, the fact that better controls 
will be difficult to achieve adds nothing to the value of uninterpretable 
data. I am inclined to press this point strongly, for it seems generally to 
be glossed over, perhaps out of sympathy with the many difficulties of 
working with retarded groups. 

Granted that the ultimate desideratum of perfect matching of groups 
on the basis of previous opportunities to learn is impossible to achieve, 
much more can and must be achieved with regard to approximating it. 
Even at the cost of reducing the number of experiments being con- 
ducted, those which are done with a view to comparing normal and 
retarded populations could be expanded to include multiple normal 
control groups differing in the degree to which they match the retarded 
group with respect to variables related to previous learning opportuni- 
ties. By this means a beginning could be made toward separating the 
contributions of opportunities to learn and individual differences in 
capacity with respect to learning situations in which these sources of 
variation are now inextricably confounded. 

A third principal methodological question concerns the selection of 
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normal control groups to be compare 
subjects. Comparison groups equate 
what unsystematically in the lite 


d with a given group of retarded 
d on CA and on MA appear some- 
rature. To my knowledge, the issues 
involved were first discussed explicitly by Denny (1964), Upon theoret- 
ical analysis, the following considerations emerge. 


If previous opportu- 
nities to learn are in fact e 


quated for the groups of subjects being 
compared, then among groups having the same CA. those with higher 
IQs will differ with respect to parameters having to do with rate of 
learning as well as with respect to present levels of the skills and 
knowledge sampled on the test. This is so, since the higher-IQ groups 
would have profited more from equal Opportunities to learn over the 
same period of time. One might expect, then, by means of these com- 
parisons, to localize the differences in learning rates with respect, for 
example, to particular types of tasks or intellectual functions, and to 


Investigate conditions that might modify the differential learning rates 
in laboratory learning situations, 


If, given the same premise 
spect to MA scores, thus differing with re 
and therefore younger, groups will have gained more from the same 


opportunities to learn and thus must rank higher on capacity variables. 
Therefore, of the groups matched on MA, those higher in IQ would be 
predicted to gain more from new e > presumably including 
opportunities to learn in lab However, in attempting 
to pursue this strategy, ; 5 
opportunities to learn cannot rez 


> different groups are equated with re- 


spect to CA, the higher IQ, 


xperiences 
oratory situations. 


=at least until very 
e for opportunities to 


ubiquitous, circumstance, the 
but differing in IQ will refle 
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Thus comparisons of learning rates and other aspects of performance 
in classical conditioning and other simplified and “artificial” learning 
situations which differ radically from those to which subjects would 
have been exposed previously in everyday life would appear to provide 
the most promising starting point for analyzing differences in capacity 
variables between normal and retarded populations. 

Again, comparisons between groups matched on MA but differing in 
1Q would be still harder to interpret. The high-IQ individuals would 
have reached the common MA at an earlier CA owing to some mixture 
of higher capacities and richer opportunities to learn. When the former 
aspect is dominant, the higher-IQ subjects of a given MA should do 
better on new tasks, since they would tend to have the higher learning 
rates. But when the latter aspect is dominant, the lower-IQ subjects, 
having reached the same level of performance with fewer opportunities 
to learn, must have the higher learning rates and should be expected to 
do better on new tasks. Thus, at the present stage of knowledge in this 
area, equal MA comparisons would seem to be almost uninterpretable 
in the vast majority of learning situations. The obvious justification for 
using both equal CA and equal MA comparison groups is that the pat- 
tern of differences on different kinds of tasks may shed some light on 
the probable roles of previous experience versus capacity parame- 
ters—and this much is apt to be achieved only if more attention is 
given to varying the composition of comparison groups systematically 
with respect to average standing on variables related to previous 


opportunity to learn. 
A third possible fruit of normal-retardate comparisons in the 


learning laboratory is the isolation of differences between these popula- 
tions on variables which are significant within well-developed theories, 
thus promoting the interpretation of the differences in biological or 
neurophysiological terms on the one hand, and facilitating the search 
for means of preventing or modifying retarded mental development on 


the other. Almost by definition, progress in this direction is unlikely to 


arise from empirically directed, comparative studies. Thus it is not 


surprising that almost no signs of such progress are to be seen in major 
reviews of the empirical literature up to the early 1960s dealing with 
learning in the mentally deficient (Denny, 1964; Lipman, 1963: Steven- 
son, 1963). Later sections of the present review will deal with recent, 
somewhat more promising, trends appearing in research programs in- 


volving more theoretical direction. 
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2. comparative experimental studies 
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versal of this trend when aversive stimuli have been used. Luria (1963) 
reports approximately equal rates of conditioning for “child oligophren- 
ics” and normal peers when a verbal stimulus was used in the role of a 
US. 

In extinction, on the contrary, though the data are by no means uni- 
form, the preponderance of studies show faster extinction in normals 
than in retarded subjects of similar CA. Primarily on the basis of this 
frequently observed extinction differential, it has been suggested (e.g., 
Denny, 1964) that the retarded learner is characterized by a pervasive 
inhibition deficit. This notion is intuitively appealing in some respects. 
Though without much specific documentation, there is a general tend- 
ency to associate mental retardation with inferior cortical functioning, 
and there is considerable reason to believe that an important function 
of the cortex in conditioning must be the selective inhibition of re- 
sponses and response components other than the precise CR un- 
dergoing acquisition. Also, it is easy to imagine extensions of the no- 
tion of inhibition deficit to aid in accounting for inferior performance 
by retardates in such situations as delayed reaction and discrimination 
learning. Further, an inhibition deficit on the part of retardates might 
account for the findings of Ross, Koski, and Yager (1964) to the effect 
that, within a retardate group, acquisition curves for eyelid condi- 
tioning under 100% and 50% reinforcement schedule were virtually 
identical and in sharp contrast to the direct relation between per- 
centage of reinforcement and rate of acquisition for college students. 

Nonetheless, it must be pointed out that the concept of inhibition 
deficit is at present little more than a speculation with some slight 
empirical support. A difference in extinction rates need not reflect a 
difference in an inhibition process in any technical sense of the term 
inhibition. To point out just one quite different process which may be 
involved in these results to an important extent, a considerable body of 
evidence indicates that a CR may become associated during training 
with contextual stimuli, including traces of stimuli occurring on pre- 
vious trials, as well as with the CS itself. When either human or animal 
subjects are given repeated sessions of conditioning and extinction it is 
generally observed that the rate of extinction becomes progressively 
more rapid. The interpretation usually offered is that the stimulus con- 
text changes greatly from acquisition to extinction, since during acqui- 
sition CRs are evoked in the presence of contextual cues from previous 
trials, all of which have terminated with USs, whereas during extinc- 
tion the CS is presented on any trial in a sequence of previous trials 
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which have not ended in USs. Differe A $ 
to) these contextual cues could be responsible for the difference in ose 
tinction rate between normal and retarded groups that has been attrib- 
uted to an inhibition deficit. The same factor could account for the 
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understanding of instructions, verbal expectations, and responsiveness 
to contextual cues associated with preceding trials of an experimental 


series. 


Operant Conditioning. Although some learning theorists as- 
sume that operant conditioning is basically similar to, or reducible to, 
classical conditioning (Asratyan, 1967; Estes, 1959; Miller, 1959), the 
two learning situations are clearly distinguishable by the temporal rela- 
tionships and the contingencies between the experimenter-controlled 
variables. Whereas in classical conditioning, training is given in dis- 
crete trials, each beginning with the CS and terminating with the US, 
in operant-conditioning experiments the organism is ordinarily per- 
mitted to respond at its own rate with reward contingent upon speci- 
fied responses. Many studies of the retarded have been conducted 
using operant-conditioning methods, but the objectives have been char- 
acteristically quite different from those of classical-conditioning experi- 
ments. Whereas almost without exception the latter have been con- 
cerned with comparing normal and retarded individuals, the former 
have nearly always been attempts to see how far the behavior of rela- 
tively severely retarded individuals can be brought under the control of 
to control operant conditioning 


the same variables which are known 
human beings and animals. On 


and discrimination learning in normal 
the whole, a review of this literature 
severely retarded individuals respond 
way to various reward schedules and contingencies as do animals and 
normal human beings (see Denny. 1964, and Krasner & Ullmann, 1965 
for summaries). However, special procedures may be needed to estab- 
lish effective reinforcers for the severely retarded (Orlando, 1965). 
Detailed information as to how rates of acquisition and extinction of 
simple operant responses might be related to intellectual level seems 


gives the impression that even 
in qualitatively much the same 


to be entirely lacking. i 
Classical and operant-conditioning methods have offered special 
advantages for the investigations of learning in the severely retarded, 


with whom communication by means of instructions is difficult or im- 


possible. The most recent review of this area, by Spradlin and Girar- 
deau (1966), indicates that basic data upon responsiveness of the se- 
verely retarded to standard USs and CSs are still sparse, with no 


clearly established systematic differences between retarded and 


normal subjects. Operant-conditioning methods have been effective in 


62 Empirical Comparisons between Normal and Retarded Populations 


training simple responses in even the severely retarded 


using food, 
tokens, and some forms of social reinforcement. 


Though the data are 
by no means as orderly as those frequently obtained with animals, 
effects of different partial reinforcement schedules and discriminative 
contingencies appear generally similar to those 
pious literature on operant conditioning of rats, I 
normal children (Bijou, 1957, 1958. 1961; Bi 
Barnett, & Pryer, 1960: Johnson, 
1965; Orlando & Bijou, 1960, 1967). 

Operant conditioning techniques are directly translatable into 
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xchanged for food and the like. Then training 
loped to shape the children’s behavior toward the 
establishment of such habits as caring for pets, making beds, group 
play, and simple work routines. Formal data seem not yet to have been 
reported, but the investigators report that “gross observations indicate 
that the desired behavior is increasing in frequency” (Spradlin & Gir- 
ardeau, 1966, p. 288). 
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example, that retarded groups may do significantly better than equal 
MA normals on simple sensorimotor tasks, even though the same 
groups yield an advantage for the normals in rote verbal learning 
(Cruickshank & Blake, 1957). 

The principal overall trends discernible in the literature (Denny, 
1964; Lipman, 1963) are that on very simple tasks retarded individuals 
perform more poorly than normals of equal CA initially but tend to 
catch up with practice; on more difficult tasks the pattern is similar, 
the retarded gaining somewhat on the normals but normal groups ap- 
pearing to approach higher terminal levels. 

On the much-studied pursuit-rotor task, performance clearly is di- 
rectly related to MA (Ellis & Sloan, 1957). Normal groups exhibit more 
rapid acquisition of skill within a single learning session and also ex- 
hibit larger increases in performance (“reminiscence”) over rest inter- 
vals (Ellis, Pryer, & Barnett, 1960: Jones & Ellis, 1962). 

One might be tempted to take the latter finding as a reflection of the 
same process underlying the presumed “inhibition deficit” in classical 
conditioning, but the analogy seems most tenuous. With relatively 
massed training, performance on simple motor tasks such as rotary 
pursuit typically levels off considerably below maximum attainable 
performance, and, in fact, it is not unusual to observe a decrease in 
performance late in the session. These effects have usually been attrib- 
uted by theorists in the area of motor learning to the buildup of some 
kind of inhibition during the session. In terms of this concept, the ob- 


served difference might reflect differences in the amount of inhibition 
ion or in rate of decay, but whereas the as- 


developed during the 
sumption of a faster rate of decay of inhibition for normals might seem 


to account in a simple way for the pursuit data, it would not be in har- 
mony with the interpretation of extinction differences in terms of an 
inhibition deficit, for faster decay of inhibition on the part of normals 
would lead to more spontaneous recovery and therefore a slower, 
rather than more rapid, observed extinction rate. 

In any event, before this apparent paradox could be taken very seri- 
ously, one would need to rule out experimentally such alternative inter- 
pretations as differences in covert rehearsal between experimental sit- 
uations or differences in tendencies to develop and retain verbalizable 
tactics or strategies that facilitate performance. 

In the learning of finger mazes or stylus mazes, which generally 
demand little skill in motor performance, earlier studies utilizing open 
mazes generally exhibited no appreciable difference in learning rates 
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between retarded and normal subjects. When, H 


lowever, the maze was 
shielded from view of the subje 


ct, a direct relationship between speed 
of acquisition and IQ emerged (Ellis, Pryer, Distefano, & Pryer, 1959). 
Since the learning of a shielded maze is clearly facilitated by verbaliza— 
tion of a sequence of turns, this latter finding may well be simply an- 
other example of the often-observed tendency for the retarded to be 
much less likely than normal individuals to develop and utilize verbal 
rules and strategies. 
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tive correlations have generally been observed between rates of 
discrimination learning and both MA and 10 (e.g., House & Zena. 
1960a), neither substantial nor consistent diffe 
tained from comparisons betwe 
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and House (1967) noted that most experiments which failed to yield 
advantages in rate of discrimination for normal children over equal MA 
retardates had utilized particularly easy or particularly difficult tasks. 
They conclude that with tasks of intermediate difficulty and with a 
wide range of IQs included in the sample, “at least a low positive cor- 
relation exists between IQ (with MA controlled) and performance in 
visual discrimination tasks Ip. 198].” 

In the search for the basis of this correlation, little positive evidence 
has been forthcoming for hypotheses attributing it to either the per- 
formance or the associative aspects of discrimination learning as these 
are conceived in classical learning theories. While MA may well be 
correlated with motivational determiners of performance, there is little 
support for a motivational interpretation of differences in discrimina- 
since variation in incentives has been notably ineffective. 


tion learning, 
Neither is there 
deficit. A careful examination of this possibility yielded little evidence 
that nonreward of responses to the negative stimulus of a discrimina- 
ely less important as a determiner of perform- 
basis of their analyses of 


support for any substantial role of a general inhibition 


tion problem is relativ 
ance for retardates than for normals. On the 
backward learning curves, and some experiments conducted under 
conditions promoting attention to the relevant dimensions of the dis- 
Zeaman and House (1967) have arrived at the 


crimination problems, 
attend to the relevant stim- 


hypothesis that even retardates, once they 
ulus dimension, learn visual discriminations very rapidly, perhaps 
often in a single trial. Some support for this idea comes from a study 
by Denny (1964, p. 112) who found that differences in discrimination 
tardates and normals of similar MA disappeared 
given to the handling of the subjects by in- 
g attention on the part of 


learning between re 
when special attention wá 
dividual experimenters with a view to assurin 
the subjects to the relevant cues of the discrimination problem. Zea- 
man, House, and Orlando (1958) found that discrimination learning by 
rably facilitated by previous training in asso- 


retardates was conside 
ciating verbal labels with the relevant cues (colors). 

Comparisons of different IQ groups with respect to discrimination 
reversal have yielded generally inconsistent results and only the barest 
start has been made toward elucidating the basis for the differences. 
House and Zeaman (1962) found little difference between retardates 
and normals with respect to the advantage of a simple reversal over a 
nonreversal shift, the former proceeding more rapidly for subjects of 
all IQ levels. A followup study by Sanders, Ross, and Heal (1965) repli- 
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cated most of the conditions of the House 
only a slight advantage for reversal ov 
and essentially none for retardates of 
studies differed considerably with re 


training on the original discrimination. A later study by E. S. Ohlick 
(reported by Ross, 1966), utilizing only retarded subjects, demonstrated 
a large advantage for reversal over nonreversal shift when overtraining 
had been given on the original discrimi 
there had been no overtraining. 

Tests for trans 
lowing discrimin 


and Zeaman study but found 
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similar MA. However, the two 
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ment. However, with a slightly simplified situation, these investigators 
obtained evidence of progressive improvement even in 
IQ subjects (House & Zeaman, 1959). Kauf, 
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these very low 
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found that retardates with IQs of 50-75, although somewhat inferior to 
normals, made substantial progress over a sequence of 95 object-dis- 
crimination problems. More detailed analysis showed the difference in 
performance between normal and retarded groups to be associated 
primarily with a greater incidence of perseverative errors on the part of 
the retardates. A number of other studies (reviewed by Denny, 1964), 
utilizing groups of retardates at various levels, have found substantial 
progress in learning-set formation for retardates, with the rate of pro- 
gress being directly related to 1Q when CA was held constant. 

The fact that in nearly all cases retarded subjects make substantial 
progress in the development of learning sets even over relatively short 
series of problems would seem to be of far greater import than their 
generally lower rates of progress as compared to equal-CA normals. 
The formation of learning sets on discrimination problems is simply a 
special case of the far more ubiquitous phenomenon of “learning to 
which has been demonstrated in a number of other situations 
It is not obvious why it should be a matter of great 
priority to continue demonstrating normal-versus-retardate differences 
in rates of learning to learn. But it would seem to be of the greatest 
more deeply the factors responsbile for improve- 
s and to determine by much more prolonged and 
ty of situations the extent to which 

generally characteristic of the re- 


learn” 
for normal subjects. 


moment to analyze 
ments in learning rate 
sustained investigations in a varie 
the relatively lower learning rates 
tarded may be raised by various types of experience. 

Verbal Learning. Differences between normal and retarded indi- 
viduals in performance on verbal tasks seem scarcely to need docu- 
mentation in view of the considerable overlap in materials and tasks 
between intelligence tests and experimental verbal learning situations. 
As with normals, the most extensive studies of simple verbal learning 
in retardates have been conducted in paired-associate situations. Over 
a variety of materials, some advantages have usually been observed for 
of equal CA, the difference tending to increase 
1963). When the paired-associate items 
ter, for example pairs of pictures, and 
ssure, retardates with IQs as low 


normals over retardates 
over successive lists (Lipman, 
are not explicitly verbal in charac 
especially when there is no time pre 
as 50 exhibit acquisition rates almost equal to those of normals of sim- 
ilar CA (Denny, 1964; Eisman, 1958). In general, with verbal materials, 
substantial correlations are observed between IQ and errors to learn 
within the population of retarded individuals (holding CA constant). 
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When normals are matched with retarded individuals on CA, the latter 
show a decrement in acquisition rate, the amount of decrement in- 
creasing with difficulty of the task. Howeve 
disappear when MA is equated, 
vious learning experience. 
In serial verbal learning, 
performance have been observed but, 
between normals and retarde 
of equal CA, a difference in 
iting relatively higher frequer 
serial list, and, conve 
(Lipman, 1963). This 
ferent tasks (which w 


r, these differences tend to 
Suggesting an important role of pre- 


even higher correlations between MA and 
again, little difference remains 
d groups when MA is equated. For groups 
pattern of errors emerges, normals exhib- 
acies of errors in the middle positions of a 
rsely, a greater advantage in the end positions 
last finding fits in with evidence from quite dif- 
ill be reviewed in a later section) tending to sug- 
gest that retarded individuals are especially deficient in the use of re- 
hearsal and similar strategies for the facilitation of recall. 

Although there have been a fair number of releyant studies, it is very 
difficult to determine whether there is any relationship between IQ 
level and long-term retentic rial that has once been learned. In 
reviews of the literature, Denny (1964) and Belmont (1967) both arrived 

i mphasized the difficulties of 
interpreting extant studies because of the very great difficulty of con- 
trolling simultaneously for amount of training and leve] of performance 
prior to the retention interval. Although further analysis with improved 
experimental paradigms would be desirable, it seems fairly clear that 
moderately retarded individuals exhibit no conspicuous long-term re- 
tention deficits as compared to normals of similar CA. 


onclusions, but e 
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Spitz, & Lipman, 1959; Hamilton, 1966: Milgram, 1967; Stephens, 
1966). On the basis of his studies of learning ability in children of dif- 
ferent IQ levels, Jensen (1963) has concluded that, 
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The habit of making verbal responses, either overtly or covertly, to events in the 
environment seems to be one of the major ingredients of the kind of intelligence that 
vol achievement and in performance on intelligence tests. Without 


shows itself in se 
t. even a child with a perfectly normal nervous system in terms of funda- 


this ha 
mental learning ability, will appear to be retarded, and indeed is retarded so long as 


he does not use verbal mediators in learning [p. 138]. 


Wallace and Underwood (1964) found that fewer implicit associative 
responses to verbal stimuli were given by mentally retarded subjects 
than by normals. In the normals such responses tend to interfere with 
paired-associate learning but aid free recall. In the retarded subjects 
the frequency of these associative responses seems too low to exert 
either effect to an appreciable extent. In general, it is quite consist- 
ently observed that mentally retarded individuals are especially defi- 
cient at retaining or transferring the use of verbal mediators even when 
specifically trained in a particular situation. 

In the area of problem solving there seem to be almost no interpret- 
able data available. One of the few relevant experimental studies, that 
of Ellis and Sloan (1959), yielded a direct relationship between rate of 
solving form-oddity problems and MA and also a positive relation be- 
tween rate of solution and IQ when CA was held constant. 

Turnure and Zigler (1964) found that mentally retarded subjects 
were more likely than normals to use other people’s behavior as a 
guide to problem-solving efforts. Both retarded and normal subjects 
were found to become more imitative after induced failure experi- 
ences. In a later study by Achenbach and Zigler (1968) retarded sub- 
jects learned a relative-size discrimination as fast as normals under 
standard conditions. But if an obvious but inefficient cue were added (a 
d over the correct stimulus after a delay) the re- 


light which appeare 
tarded individuals continued to use this cue much longer than normals. 
Achenbach and Zigler referred to this tendency as a cue-learning 
strategy and hypothesized that retarded individuals tend to depend on 


such strategies in problem situations rather than searching for intrinsic 
relationships. They found evidence that this tendency could be over- 


come by specific training. On the whole, scarcely a beginning has been 
made toward separating the roles of differential previous experience 
and possible differences in capacity variables in this area. 


Summary. From the rather large volume of work dealing with 
comparisons between normal and retarded groups in a variety of 
learning situations, only a few very tentative conclusions can be drawn. 


It does seem clear that none of the learning or retention processes that 
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have been analyzed in normal human learne 
qualitatively in the mentally defective, 
foundly retarded level (below 10 
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VI. Modifying Rates of Learning and Forgetting 


The fruits of studies of mental development, including retardation 
and acceleration, should not simply be a collection of descriptions of 
individual differences, but should also provide insight into ways of pro- 
ducing desired changes in rates of learning or in retention. One who 
surveys the empirical literature on learning in relation to mental devel- 
opment (e.g., Denny, 1964) or, for that matter, the standard literature 
on laboratory studies of learning (e-g-, Hall, 1966; McGeoch & Irion, 
1952) cannot but be impressed at the vast quantity of work that has 
gone into demonstrating effects or differences which achieve statistical 
significance but are too small in magnitude to be of much practical 
import. Consequently, in this section I shall attempt to abstract from 
the literature bearing on each of the principal experimental paradigms 
reviewed above, and on selected educational settings, the studies dem- 
onstrating variables or procedures which produce large effects upon 


learning rates. 

In this context, it is difficult to define “Jarge” in any precise way, 
but, roughly speaking, I shall limit attention to variables whose effects 
are so substantial and reproducible that variations in them might be 
expected to produce readily detectable effects upon the learning rates 
of virtually all individual subjects. It will be convenient to follow much 
the same outline as in our previous review of empirical studies, in each 


case pointing out the implications of my general theoretical position 


and summarizing relevant extant results. 


nd avoidance conditioning 


In the theoretical view developed above we assume that associations 
form whenever the organism is effectively stimulated by any two stim- 
ulus patterns which occur in succession and that the process is rela- 


tively independent of other circumstances except for variables which 
isturb the consolidation process. Thus variation in 
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Since response evocation is assumed 
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obtain, however, it is necessary not only that the organism experience 
the two stimuli in succession at some time but also that the painful 
stimulus occur more often in the presence of the signal than in its 
absence. Although it seems almost unaccountable in retrospect, this 
extremely important relationship was largely overlooked in the con- 
ditioning literature until the appearance of a recent series of studies 
by Rescorla (1966, 1968). These important experiments demonstrat- 
ed clearly that for a given number of pairings of a CS and US, say 
a signal and a shock, probability of the occurrence of the CR upon 
subsequent tests could be varied over an extremely wide range by 
varying the frequency with which the shock occurred during intervals 
of time in the situation when the CS was not present. In a special case 
when the probability of a shock during an interval of time was the 
same whether or not the CS occurred during the interval, the CS was 
found to have acquired no control whatever over responding. 

Various findings which have seemed otherwise unrelated and which 
have been accounted for by various independent, ad hoc hypotheses, 
may be interpreted in terms of this major principle. For example, it is 
well known that if a stimulus to be used as a CS has been presented a 
number of times alone at the beginning of an experiment, the rate of 

a series of CS-US presentations is re- 


subsequent conditioning during 
tarded (e.g., Harris, 1941). It is apparent that if the CS is paired with 
the US from its first appearance in the situation, the organism can very 
quickly learn that the probability of a US is greater in the presence of 
the CS than in its absence, whereas if the CS has appeared a number 
gs of CS and US must necessarily be 


of times alone, a number of pairin 
JS now has a higher 


required before the organism can learn that the l 
probability in the presence of the CS. 
Another pertinent example may be the 
demonstrated in an extremely provocative series 
Kamin (1965). In his basic experimental paradigm, Kamin trained rats 


to anticipate shock by presenting a stimulus, St, followed by shock a 
a standard training procedure for the 


“blocking” of conditioning 
of experiments by 


number of times in succession ( 
CER). Then he added a second stimulus Se. presenting S, plus S} fol- 


lowed by shock on a number of trials. When, following these two 
phases of training, tests were given with Sz alone, the animal's be- 
havior revealed no evidence of conditioning to S. Kamin suggested 
that prior conditioning of S, had resulted in blocking the conditioning 
of the “redundant” stimulus S2 when it was later added to the CS 
compound. Another way to look at the same Kamin study is to note 
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that during the first phase the animal established a certain tender 18 

a ol Sz 
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Kamin’s later experiments), 
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the optimum interval for conditioning is very narrowly limited in some 


situations, as for example eyelid conditioning, whereas in others rates 
of conditioning are relatively independent of CS-US interval over a 
range of minutes or even hours. 

In order to make sense of the variegated results, we should note first 
of all that conditioning has been observed at very long intervals (in 
some cases an hour or more) only when both CS and US are of such a 
having appreciable communality with either 
aterval. Adequately documented 
JS are gustatory or in which the 


character that stimuli 
would be unlikely to occur during the ir 
cases are those in which both CS and L 
CS is gustatory and the US is exposure to x rays, inducing gastrointes- 
tinal disturbances and nausea, (Garcia & Koelling, 1966). In the situa- 
tion where the index of conditioning is not a narrowly defined response 
but rather a persisting postural adjustment or change in level of 
ongoing activity [for instance, CER, passive avoidance conditioning, or 
“conditioned anticipation” of food (Estes, 1948)], the function relating 
rate of conditioning to the CS-US interval is relatively shallow, and 
effective conditioning can be obtained with intervals several minutes in 
duration. When the response has a clear preparatory function, that is, 
ffect upon the efficacy of the US (as in the case 


a positive or negative e 
acid as the US), effective condi- 


of conditioned salivation with food or 
tioning is obtained with intervals of the order 30 sec to 1 min (Pavlov, 


1927). When the CR is a sharply defined phasic reaction, like the eye- 
blink, and particularly when the occurrence of this response will affect 
reception of the US only if it occurs during some brief interval of time, 
the function relating rate of conditioning to CS-US interval is steep, 
with the optimum value falling in a narrowly defined interval, usually 
of the order of .25-.50 sec. 

Relations between the CS and US other than the interval between 
them are also important for the conditioning process. It has frequently 
been observed, and would be expected on theoretical grounds, that 
conditioning is facilitated if the CS initially evokes a response similar 
to, or at least not antagonistic to, the UR. Some recent studies have 
shown further that relationships apparently involving simply the mo- 
dalities of the CS and US may be of major consequence. For example, 
Garcia, McGowen, Ervin, and Koelling (1968) showed that when rats 
were given flavored food pellets followed by administrations of x rays 
which produced illness, avoidance behavior on subsequent tasks was 
controlled by the flavor of the food pellets but not by the size. Contrari- 
wise, when the pellets were given preceding administration of electric 
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shock to the skin, subsequent tests revealed conditioned ig 
behavior to be a function of the size of the pellets but no ol ace ae 
These effects were very large in magnitude, the less effective 1 
in each case yielding sentially no evidence of conditioning whereas 
the more effective CS yielde 
very few trials. 

In avoidance conditioning, 
classical case, the 
US interact in e 
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Avoidance conditioning 
though the onset of the € 
shock is likely to follow, occurrence of the 
Sponse produces no clearly perceptible 


study by Bower, Starr, and Lazarovitz (1965) has shown that condi- 
tioning is greatly facilitated if conditions are arranged so that the oc- 
currence of the avoidance response not only Prevents the shock but 


also produces a clearly Perceptible new Stimulus which is correlated 
with absence of shock (and thus becomes gs 


ganism). Similarly, larg 
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verbally induced attitudes or sets. Aspects of the experimental situa- 
tion, most especially specific instructions, which lead the subject vol- 
untarily to facilitate or inhibit the CR may produce increases or de- 
creases in CR frequency by as much as a factor of two or three as 
compared to levels produced by training under more neutral conditions 
(Hilgard, 1938: Kimble, 1961; Miller, 1939). Similarly, manipulations of 
the experimental situation and instructions which lead the subject to 
anticipate discontinuation of the US may produce apparent extinction 
of the CR prior to the first nonreinforced trial (Cook & Harris, 1937). 


2. operant conditioning 

In view of the well-documented efficacy of operant conditioning 
methods for establishing control of behavior, it might seem that the 
rich with examples of procedural manipu- 


operant literature should be 
nces in rates of conditioning. How- 


lations which produce major differe 
ever, it has been a tradition in this area for the acquisition process to 


be taken more or less for granted. Investigators learn in the laboratory 
the suitable procedures for producing effective conditioning in a given 
situation, and once having brought behavior under control by suitable 
reinforcement contingencies proceed to study the effects of schedules 
of reinforcement, drugs, or other conditions. It is difficult to find ex- 
plicit controlled comparisions of the effects of different procedures 
upon acquisition rates. 

On the basis of general experience, it appears that optimal condi- 
tions for acquisition are essentially those which one would expect on 
the basis of theoretical analysis. First of all, it is always essential to 
prepare for conditioning by insuring that the organism accepts the 
reward which is to be used, and that some clear signal of the adminis- 
tration of reward is recognized by the learner. Conditioning of so 
g in the rat or key pecking in the pi- 
l has not first been trained to accept 
dispenser does not produce some 
lus conditions whenever food has 


simple a response as bar pressin 
geon can be tedious if the anima 
food readily in the situation or if the 
clearly perceptible change in stimu 


been made available. 
Even with human subjects, attention needs to be given to the appro- 


priateness of the reward for the particular type of learner and situa- 
tion. For example, Locke (1968a, b) found this variable to be of critical 
importance in obtaining verbal conditioning with moderately retarded 
individuals. Although previous attempts, with “good” used as a rein- 
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forcer for verbal responses, had be 
were obtained with tokens 
inforcers. 
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en unsuccessful, substantial effec 
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the same verbal utterance may have quite dif- 
iveness as a reinforcer for a given individual 
aracteristics and behavior of the person doing 


mporal succession by 
the stimulus consequences of re- 
s signifying the onset of reward. 

ntally deficient, verbal instructions 
nstrations can be effectively utilized to further this program 
(Ramsey, 1963). 

Some major re 
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limits of discrimination learning have turned out to be relatively inde- 


pendent of either motivation or amount of reward, provided only that 


the motivating and reinforcing conditions are adequate to maintain the 


behavior involved in training. 

The precise manner in which the organism is brought to the point of 
making the necessary responses in the training situation may be of 
considerable import. Frequently, particularly in studies with animals, 
young children, or the mentally retarded, pretraining consists in first 
rewarding the learner for responding to both positive and negative 
stimuli; only after the learner responds readily to all stimulithat will 
be involved in the task are differential contingencies arranged so that 
only the correct response to each stimulus is rewarded. A result of this 
kind of pretraining is that mastery of a discrimination must entail un- 
learning of habits of responding which were rewarded during pre- 
training but which were classified as errors after discriminative contin- 
gencies were instituted. 

An alternative to the tedious experimental extinction of nonadaptive 
responses is the temporary breakup of old habits, and consequent in- 
stigation of response variability, by an abrupt change in conditions. For 
example, Zeaman, House, and Orlando (1958) reported that after sev- 
eral retarded children had failed to progress on a color-form discrimi- 
nation for over 200 trials, the replacement of the original positive or the 
original negative stimulus with a novel one led to sudden learning of 


the new problem. 

By contrast, under a procedure first explored by Skinner (1938) and 
later developed systematically by Terrace (1963a, b), the animal is 
trained in such a way that it is rewarded for making the correct re- 
sponse to the positive stimulus from the outset and never develops 

be unlearned. For example, 


Strong incorrect responses which have to 
d key and withhold response from a 


in training a pigeon to peck a re 
bird for pecking at the red key until 


green key, Terrace rewarded the ; 
the behavior was well established, then during intervals between trials 


on the red key he gradually faded in the green color in such a way that 
the animal, which initially had no tendency to peck a darkened key, 
did not begin responding to the key during unrewarded intervals. By 
suitably programming the training, it was possible to introduce the 
negative stimulus, in the example the key illuminated with green as 
opposed to red light, sufficiently gradually so that the animal did not 
begin responding to it at any step- Ultimately the animal was observed 


80 Modifying 


Rates of Learning and Forgetting 


to peck the red key and ignore the gre 
experienced nonreward for responding 
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ce of the frustration or other emotional 
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g others, Terrace (963a, b) working with 
orking with rats, and House and Zeaman 
dates, found that learning of a relatively 
d be greatly facilitated if the subjects were 
discrimination involving the same stimulus 
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Early studies by Muenzinger and Fletcher (1937), among others, 
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showed that the speed of discrimination learning in the rat could 
substantially increased over usual levels by such measures as the 
troduction of a glass barrior or a gap in the pathway of the discrimina- 
aio nee ei just prior to the choice point, which forced the animal to 
T lus inc reased the likelihood of its perceiving the relevant 

3 ne choice point before making a choice response. Following 
2 quite different approach, Lawrence (1949, 1950) showed that once 
BS oe 5 learned the relation between a stimulus property and re- 

While acquiring one discrimination, they exhibited an accelerated 
learning rate, as compared to controls, on a new discrimination in- 
volving the same positive cue, even though the response involved was 
quite different. Bower and Grusec (1964) showed that much the same 
result could be produced if the animals were simply given prior 
training in which the positive cue was paired with reward presenta- 
tions and the negative cue with absence of reward; a similar phenom- 
enon has been demonstrated with children by Witryol, Lowden, and 
Fagan (1967). 

Conversely, if the individual for some reason has learned not to at- 
tend to the relevant cues of the discrimination task, rate of discrimina- 
tion learning is substantially impaired. For example, Levine (1962, 
1963) showed that, with adult human subjects, even a very few trials of 
partial reinforcement training with respect to cues which were clearly 
discriminable by the subjects severely crippled the acquisition of a 
discrimination between these cues when subsequently they were dif- 
ferentially related to reward and nonreward. 

Comparing the learning of visual and tactual discriminations by 3-4- 
year-old children, Blank and Altman (1968) found both original acquisi- 
tion and reversal to be more rapid for the tactual problems. Analysis 
indicated that this superficially surprising result is attributable to pro- 
cedures which more effectively constrained the children to attend to 
the more subtle relevant cues in the tactual stimuli. In a second experi- 
ment, reversal of the visual discrimination was substantially facilitated 
by an imposed delay on each trial during which the children viewed 
both of the stimuli presented before making a choice, and was further 
speeded if the children were required to draw the stimuli they chose. 


Similarly, Caron (1968) found that acquisition and transfer of a visual- 
form concept (angularity) in 3-year-old children were substantially fa- 


cilitated by procedures which led the children to respond actively to, 
and therefore presumably attend to, critical features of the stimulus 


figures. 
In the case of mature h 


In- 


uman subjects, verbal instructions are the 
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most common means of directing attention toward the relevant cues in 
r appropriate conditions, the aurs 
even with the severely retarded. An early 
bility (Zeaman, House, & Orlando, 1958) 
the imbecile range, rate of learning a gulor 
y related to ability to name the colors. More 
5) demonstrated facilitation of vis 1al-dis- 
part of both normal nursery-school children 


. : 8 . salcu- 
hen specific Instructions were given, ca 


lated to direct the children’s 


attention toward the relevant cues. This 
result was extended by Milgram and Noce (1968), who showed further 
that verbalizatio dimension interfered with learning. 
Dickerson, Girardeau, and Spradlin (1964) obtained evidence that 


directed to dimensions (see pp. 170-171). 
Attention to relevant st Tes is not a sufficient condition 
n learning. The organism must experience in 
at cues and the differential re- 
electing the Positive or negative values on 
discrimination learning it ap- 
> ‘ ination even between such con- 
spicuously different stimuli a d white doors only with tedi- 


. à 3 running to hundreds and even thousands of 
trials. Lashley diagnosed the Source of the difficulty as being the re- 


moteness of the discriminative cues from the rewarding consequences 


5 de i š a 
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the stimulus pattern was dis 
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black and white but betwee s open and filled (dangles: 


Pes in training series 
Y, 1930). Further ; j 
i A 1Mprovements in the 
apparatus by Fields (1954) yielded results Showing a 1 could 
not only learn discriminations between complex 18041 . 
as pictures cut from magazines, relatively rapidly, but Dae 8 i 
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and maintain a considerable number of such discriminations concur- 
rently. 

The importance of contiguity of cue, response, and rewarding or 
nonrewarding outcomes is not limited to the lower animals. In more 
recent studies Jarvik (1953) and Murphy and Miller (1955) have shown 
that primates are susceptible to variation in these factors, discrimina- 


tions which are learned with extreme rapidity when cue and reward 
difficult or impossible with 


are in close spatial juxtaposition becoming 
en demonstrated with 


even small separations. Similar results have be 
young children by Jeffrey and Cohen (1964). 
Various means other than simple spatial or temporal contiguity can 
be used to aid the learner to perceive relationships between changes in 
relevant stimuli and changes in rewarding outcomes. For example, 
Bauer and Lawrence (1953) showed that discrimination learning by rats 
in a simple two-choice apparatus could be considerably facilitated if 
the colors (black or white) of the pathways corresponded to the colors 
of the positive and negative goal boxes. In similar vein, Shepp (1964) 
showed that learning of successive discriminations by retarded chil- 
dren could be substantially facilitated if distinctly different rewards 
were uniformly associated with the two positions in the modified Wis- 


consin general test apparatus (WGTA) and similarly that simultaneous 
modified over an extremely wide 


discrimination learning could be 
s of the cues and rewards were 


range according as stimulus propertie 
Positively or negatively correlated. 

Similarly striking modifications of discrimination-learning rates in 
the pigeon have been reported by Jenkins and Sainsbury (1969) in a 
recent series of studies involving the correlation of distinctive stimulus 
characteristics with either reward or nonreward. The situation involves 
two keys, one of which typically presents a small, distinctive stimulus 
such as a circle or square on a uniformly lighted background whereas 
the other key is uniformly lighted with no such feature present. When 
the distinctive feature is on the positive key, discrimination learning, 
as measured by relative frequency of choices of positive key, is ex- 
tremely rapid, but when the distinctive feature is on the negative key, 
acquisition is severely retarded with, in some cases, no departure from 
chance responding evident over a long series of trials. 

Detailed analyses of the particular loci on the keys at which the 
birds pecked revealed that rates of learning in the two cases were ac- 
tually about the same, the animal learning to peck the distinctive fea- 
ture when it was associated with reward and to avoid the distinctive 
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need for forming additional associations. Both the size of the individu- 
al’s repertoire of such associations and his specific skill or set toward 
searching for these in new situations must be important determiners of 
performance during paired-associate learning. 

One of the most conspicuous generalizations arising from a review of 
the empirical literature on paired-associate learning is that the majority 
of the variables commonly studied, including distribution of practice, 
length of list, whole versus part presentation of list, and immediacy of 
knowledge of results, have very small effects upon rates of acquisition. 
By far the most important factor is the collection of variables sub- 
sumed under the notion of “meaningfulness” of the material being 
learned, and within this category, response meaningfulness has proved 
of considerably greater import than stimulus meaningfulness (Under- 
wood & Schulz, 1960). Meaningfulness involves, of course, a relation- 
ship between the material and the individual doing the learning and 
consequently must depend heavily upon previous experience. Conse- 
quently observed rate of acquisition of paired associates can be taken 
to be a measure of learning ability only if aspects of the learners’ past 
histories pertaining to meaningfulness of the material are adequately 
controlled. Conversely, learning of a particular type of material can be 
substantially facilitated if suitable measures are taken prior to training 
to increase the meaningfulness of the material for the subjects. 

As in the case of discrimination learning, paired-associate learning 
can be modified considerably by measures which facilitate or interfere 
Sprague and Binder (1966) found that 
both mildly and moderately retarded children acquired paired asso- 
ciates much faster when the response members of items were oral than 
when they were push-button operations: evidently in the latter case, 
the motor response interfered with the subject s orientation toward the 
stimulus. Using a somewhat similar paired-associate task in which pic- 
tures were associated with pushbuttons, Smith, Means, and Fishkin 
(1968) showed that acquisition and transfer by retarded subjects (IQ 
35-80) could be substantially improved if the subjects were forced to 


es and corresponding buttons. 
For reasons that are by no means fully understood, the use of verbal 


labels and other verbal mediators is frequently found greatly to accel- 
erate the acquisition of paired associates in subjects at all develop- 
mental levels. Quite possibly one function of the use of such mediators 
is to direct the learner’s attention toward differentiating features of the 


with stimulus differentiation. 


name the pictur 
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items to be learned. Another may be 
tions available to su 
tionship. 


2 2 at verbal la- 
A series of studies by Bryant (1965a, b) has shown that 1 ae 
beling of pictures increases recall on the part of both norma 


$ ae r for nor- 
mentally retarded children. Recognition however is increased fo 
mals only. A series of studie 


ined the role of other form 
learning by both normal anc 


` associa- 
to increase the number of ass 1 
a . Sern se rela- 

pport recall of a particular stimulus-re spon 


s by Jensen and his associates has A 
s of verbal mediators in pairon aS ane 
d retarded adults (Jensen, 1965: 1 s 
Rohwer, 1963a, b). Acquisition of paired associates by retarded adults 
was greatly facilitated when the subjects were instructed to make 1 
Sentences using both members of the paired-associate item. When the 


are sith facili 
experimenter supplied mediators, acquisition was substantially fac 
tated but the ad 


was lost on retesting 10 days later. Similarly 
Milgram (1967) found facilitation of pair 


PADES h 
ed-associate learning in bot 
bjects by mediat 


pe of research that men- 
ously use mediators, but there 

er they could be led to do so by 
appropriate training period of time. 


5. educational situations 


is very common, and, further, tha 


S resp 
titude toward learning theory whi 


ssion that this expectation 
t it i 
ich develo 


onsible for the negative at- 
Ps in many educ 
observing that what he finds in this literature js almost ex 
collection of strenuous attempts to demonstrate Statistical s 
of very small differences in learning rate. 


I would like to suggest that the expectations are based on obsolete 
conceptions of learning theory and thus that the disappointments 85 
unjustified. From the standpoint of modern learning theory, as Sutlinetl 
in the preceding chapters, ability to learn any particular type of task, 
and in particular the relatively complex one 


s involved in school learn- 
de, depends always t a malor elent upon a combination of skills, 


ators upon 
clusively a 
ignificance 
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concepts, and learning strategies, which are in turn a function of an 
individual’s previous experience. Therefore there is no reason to ex- 
pect that learning theory, as it evolves toward a higher level of scien- 
tific adequacy and rigor, should be increasingly able to dictate methods 
which will produce major changes in rates or limits of learning when 
applied across the board to groups of individuals with differing learning 
histories. 

We should expect direct applications of learning theory to be of 
major effect largely as they are directed to the diagnosis and manage- 
ment of individual cases. Of necessity it must be difficult or impossible 
to assess the efficacy of learning theory at any particular stage of de- 
velopment by controlled experiments. It may be instructive, however, 
to cite a few particular examples illustrating what can now be accom- 
plished in dealing with cases of learning disability and in suggesting 
some potentialities for the exploitation of learning theory in this type of 
application. 

A second major point of co 
the development of technology. 


ntact between theory and practice lies in 
Devices such as teaching machines 


resulting from engineering advances do not in themselves influence 
rates of learning but they do permit the trainer or educator to apply 
more effectively whatever principles, be they good or bad, he chooses 
to employ. 

Analyses of Individual Cases. Although an individual’s failure to 
attain some criterion of performance is conventionally taken as evi- 


dence that he lacks some necessary capacity, appropriate analysis 
some quite different source of the observed disability. 


mentally retarded, for example, one must 
hat an individual’s apparent inability to 
make a discrimination may Wenn reflect the experimenter or tester's 
failure to communicate with the subject. Some rather impressive suc- 
cesses in dealing with this type of problem by application UE Sprit 
conditioning techniques will be discussed in Chapter XII, together 
with other applications of operant principles to the training of simple 


habits in individuals of extremely limited behavioral repertories. 

It is quite likely that even at much higher ednoational ison hones th 
apparent learning or perceptual disability sometimes actually reflect 
failures of communication. Because of the greater complexity of the 

tuations, these cases will necessarily be less ob- 
be uncovered only by careful experimental anal- 


often uncovers 
In dealing with the severely 
be alert to the possibility t 


learning or testing si 
vious and may often 
ysis. 
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ased by training whose 
he children to attend to 


ing-readiness test if they 
concept of similarity involved (in 
imilar form but different orienta- 

same). In their experimental study, 
Caldwell and Hall obtaine ntial support for this fap hana oil 
training with respect to the concep- 


to produce learning in an individual of 
the usual initial response of the trainer OF 
sing the incentives for successful perform- 


r amounts of practice. However, no increase 
unt of practice c 
zonstituent habits or 


apparently normal ability, 
teacher consists in increa 


8 k by means of 
earphones as he attuned his voice to the pitch of the note. Then the 


tone generator was turned off and the subject continued to make the 
sound independently. At the next step the subject reproduced the 
sound after an interval of up to a half-dozen seconds then proceeded to 


5. Educational Situations 89 


reproduce simple patterns of notes. Following this training the children 
were able to pass the tests which they had originally failed. Leontiev 
concludes, 


investigation of cases of inability to perform fully certain definite intellectual actions 
shows that th s only a function of an incorrectly formed functional system which 
can be successfully reconstructed. To do this, as experiments have shown, it is neces- 
sary to bring to light the undeveloped or incorrectly developed links, and to take people 
back to the stage at which the links in question are formed in their elementary ex- 
ternal form [p. 232]. 


Another study illustrating the same motif involves 7-8-year-old chil- 
dren backward in learning arithmetic (Slavina, 1957). The particular 
children under investigation showed unusual “intellectual passivity,” 
which was attributed to unfortunate earlier training. To remedy this 
passivity, the children were induced to enter into games involving 
numerical calculations. This measure seemed to lead to more positive 
affect and “lively cognitive activity” on the part of the children but not 
to appreciably greater success in learning arithmetic. Detailed analysis 
of the behavior of some of the individual children revealed that al- 
though they knew the names of the digits and could apparently count, 
they failed to grasp the conception of one number succeeding another 
or of the principle that any number and its successor differ by one unit. 
The children then were given special training in the elements of count- 
ws of sticks being coordinated with the use 
of numbers. With the explicit use of habits of counting thus estab- 
lished the children were able to learn addition and subtraction in the 
experimental setting. It is further reported that following this training 
they were able to learn multiplication in the regular classroom at an 


apparently normal rate. 


ing, manipulations with ro 


Technologies. The most widely known application of learning 
principles to the technology of education is the combination of appa- 
ratus and procedures which has become familiar under the rubric 
“teaching machines” (Lumsdaine & Glaser, 1960). As introduced by 
Pressey (1926), the teaching machine was simply a device which per- 
mitted a pupil to proceed through the memorization of a set of verbal 
material at his own pace, being exposed to a single item of information 
at a time and continually testing himself by means of multiple-choice 


questions. 
In Skinne 
was broadened by the application o 


r’s hands the rationale for utilizing the teaching machine 
f operant conditioning principles 
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and expanded into the conception of programmed instruction (1954, 
ching machine over 
yzing material to be 
sponse units and ar- 
pupil would have to 
“response relationship before pro- 


the specific ways in whi 


would be expected to respond differently to various questions and 


has not. A second major 
ptions of errorless discrimination learning 
in a previous chapter, is to break 3 
fficiently small steps so that the pupil 
ence making very few errors and thus 
n of rewards for correct responding. As 
rant conditioning, errors are taken to be 
of the program rather than defects on the 
part of the learner. 

One important me 


training sequence down into su 


rrors is the “fading” technique, 


whereby the pupil is th the full set of constituents of 
a relationship which is to be learned; 


tee then on successive occasions 
elements are eliminated until 


Proceeding through the 
ly by reward if correct and im- 

The question of principal interest in the 
the use of teaching machines and the associated rationale have pro- 
duced any unusually successful results in bring; 
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In some of the earlier reports, Ellson, Harris, and Barber (1968) and 
Birnbrauer, Bijou, Wolf, and Kidder (1965) report some success in 
teaching vocabulary to retarded children who had presumably made 
little progress with usual methods. Price (1963) found no advantage for 
teaching machines in the instruction of retarded children in addition 
and subtraction. On the other hand, Blackman and Smith (1964) report 
that their programmed instructional sequence produced substantial 
increases in word recognition in institutionalized retardates. By the 
end of their sequence the subjects were able to read phrases and sen- 
tences and exhibited relatively high retention of these skills over a 3- 
month interval. Similar results were reported for the acquisition of 
simple arithmetic skills. Malpass, Hardy, Gilmore, and Williams (1964), 
also working with institutionalized retarded children, report that their 
programmed instruction sequence led to the acquisition of word recog- 
nition and simple reading and spelling skills at a rate appreciably supe- 
rior to classroom teaching. Again retention was impressively high, 
75-90% on tests of ability to read words, phrases, sentences, and para- 
graphs over a 3-month interval. Similar results were reported by 
Lawson and Watson (1964). 

Virtually no evidence is available in the form of experimentally con- 
trolled comparisons between programmed instruction and conventional 
teaching methods in the training of individuals with learning disabili- 
ties. This, is not to be taken as a criticism of investigators in this area, 
however, for fully controlled comparisons are of necessity extremely 
difficult to achieve and, further, I can see no reason to think that ef- 
forts in this direction deserve high priority. Surely the first order of 
business must be to find techniques which produce substantial effects 
on performance, then by suitable manipulations and variations within 
the particular approach to achieve an understanding of the factors re- 
sponsible. The positive results reported to date doubtless have been 
achieved because of some and in spite of other features of the methods 
used. 

On general theoretical grounds I should be inclined to believe that 
the favorable results obtained with teaching machines may be attribut- 
able primarily to the attention to detailed analysis of any particular 
learning situation, and the use of errors as diagnostic information. On 
the other hand I have some reservations about the heavy emphasis on 
immediate rewards often associated with the use of teaching machines 
and other applications of operant conditioning methods. Motivation 


and reward are important, but for effective training they need be used 
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so that multiple relationships between actions and rewards oe — 

simply place a substantial additional burden on the learner's informe 

tion- processing capabilities 
Some of the 


5 x 975 eto 
principles of management of rewards that seem to m i 

be indicated by modern learning theory are perhaps more effec twenty 
embodied in a quite different bit of technology. the “autotelic environ 


5 . r 3 i j istic 
ment” devised by Moore (1966) for the purpose of teaching teat 
skills to very young children. With Moore's “talking typewriter = 
child receives very prompt and clearly interpretable feedback from ye 
responses and conditions are optimized for the close association 10 
spoken sounds, for example letters of the alphabet, which the chil 
„ „ „ . 2 -4 

hears, and the corresponding visual forms which he produces on 45 
typewriter. The child is not distracted by the administration of rewar¢ 
following individual responses, but atte 
to enter the experiment 
| have see 


ntion is given to motivating him 
al situation and take part in the tasks provided. 
n no experimental studies of this technique, but rather 


detailed accounts of individual Cases re 
considerable success at 
younger age than that at which reading instruction is usually at- 
tempted. It might be noted 


also that Moore makes consider 
the device of embedding the associatiye 


ity, for example, learning to read or to 

producing a classroom newspaper. 
The latter aspect of Moore's procedure could be 

portant for its Motivational value Dirt 

possibilities of inducing muc 


ported by Moore (1966) suggest 
teaching reading to children at a much 


able use of 
learning task in a larger active 
type letters in the activity of 


expected to be im- 
Perhaps more important for the 
h more extensive and more varied repeti- 


n han can be Secured solely within experi- 
arning Situations, 


It is well established that the availabilit 
strongly influenced by the frequency with 
vated (see, e.g., Allen, Mahler, & Estes, 1969). Eve 
training programs, however, the amount of repe 
ciations established in experimental or school si 
smaller than that commonly attained 
the laboratory. The substantial 


tion of learned responses t 
mental or school le 


Y of learned associations is 


which they haye been acti- 
n within intensive 
tition accruing to asso- 
: tuations must be much 
Vin everyday life Situations outside 
„an success claimed by recent Russian 
writers for their programs of education of the mentally retarded in 
their “auxiliary schools’ is attriby Portant extent to the 


tis the instigation of 


N and the establishment of 
connections between material learned in school and Practical activities 


carried on outside (Shif, 1969). 


ited to an im 
use of “the method of modified repetition,” tha 


repetition and rehearsal in varied contexts, 
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Vastly increased potentialities for providing practice in varied con- 
texts within an experimental situation, and more importantly for 
adapting instructional programs to individual cases of learning disabil- 
ity, are provided by the new techniques of COMpulerassisted instruc- 
tion which are just becoming available for experimentation on a major 
scale. Also, with these techniques it is possible to collect and analyze 
data from educational learning situations on a scale adequate to open 
the way to rigorous tests of the implications of learning theories in 
practical settings. One of the first substantial reports of a computer- 
assisted educational program for elementary-school students (Suppes, 
Jerman, and Brian, 1968) has provided evidence of an important func- 
tion of shortrerm memory in the solution of arithmetic problems by 
children, thus opening up the possibility of bringing to bear upon the 
teaching of arithmetic results of the extensive current experimental 
and theoretical work on short-term memory. 

The same advances in technology may, within the foreseeable fu- 
ture. permit the emergence of a type of research which at present is 
almost entirely lacking and in the absence of which only the must tim 
ited progress can be expected toward theoretical interpretation of the 
development of learning abilities. At present nearly all experimental 
studies of learning involve very short time periods, lapping the abilities 
of the individual only at one particular stage of development. Develop- 
mental studies provide some picture of the learning and cognitive abili 
lies which characterize individuals at different ages which thus must 
be presumed to develop as a result of the haphazard sequence of expe- 
riences available within the normal range of environments. But vir- 
tually no information is available to permit the comparison of the ef- 
sequences of experiences over appreciable 


fects of different specific 
periods of time on the development of motives, general behavioral 


strategies, or learning and cognilive abilities. 


Part C | 
A pplications of Learning Theories 


VII. Hull's Theory 


The dominance of Hull's theory in American research on learning 
between 1940 and the late 1950s can scarcely be overestimated. One 
concrete, though informal, bit of documentation results from a survey 
of journal literature on learning conducted by members of my graduate 
seminar on learning theory at Indiana University in 1955. 
ssify each article on learning in 


The students were instructed to e 
the 1954 issues of the major American journal sources (Journal of Ex- 


perimental Psychology, Journal of Comparative and Physiological 
Psychology, Psychological Review, and American Journal of Psy- 
chology) in terms of the theory, if any, which played a significant role 
in the formulation of the research problem or the interpretation of the 


results. Of 190 articles examined, 58% were judged to be related to 
nt, and, of these, 46% were 


some learning theory to an important exte 
s system (including the 


formulated and interpreted in terms of Hull’ 
“neo-Hullian” theories of Spence and Miller). Even these figures do 


not fully indicate the degree of influence of the system, for in many of 
the articles judged to be primarily empirical in orientation, such con- 
cepts as drive, reinforcement, stimulus trace, and inhibition were used 
in much the same way as in those more closely affiliated with the 


formal theoretical system. 
Hull’s system represented a major 
ditions which guided a great part of American research on learning 


prior to 1940—associationism, conditioning, and law of effect theory. 
As a preliminary to discussing the representation of individual differ- 
ences and the approach to problems of mental development within the 
system, it will be useful to review briefly some of the major concepts 
and assumptions. For this purpose the formulation given in Principles 
of Behavior (1943) provides a convenient point of departure. 


attempt at a synthesis of the tra- 


1. major concepts and postulates 

One of Hull’s principal theoretical contributions is a definite and 
workable conceptualization of the distinction between learning and 
performance more specifically, of the contributions of learning and 
motivational conditions to the determination of behavior at any time. 
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The effects of prior learning are re 
including inhibition which may be 

Behavior at any time is taken to be 
according to the familiar formula 


i ` habit, 
presented in the concept of 1 
viewed as a sort of negative ha vit. 
a joint function of habit and drive, 


E=HxXD-—-T, 


where E denotes excitatory tendency for some given bit of behavior, H 
the habit involved (that is, 


and response), D the drive 
section some of Hull’s sub 
of exposition). 


It is assumed that any given index of response tendency is some 
function of E, that is, 


rece > 11 18 
the associative relation between simo 

se 3 P ne 
condition, and / inhibition. (Throughout 5 

script notation is suppressed for simplicity 


R= fF). 


and representation of a family of spe- 
uld express some response measure as 
Some specific function of excitatory tendency, e.g., P= WE — L), 
where P denotes response Probability, 
function, and L the threshold of e 
denotes response latency; n = ¢ 
responses to extinction; and 4 
amplitude. Habit strength H is 


W is the normal probability 
xcitatory strength; T = He, where T 
B where n denotes number of 
FEE k", where A denotes response 


in turn, assumed to be a function of a 
number of independent variables: 7, the delay of reinforcement fol- 


lowing response occurrence; t“, the interval between stimulus and re- 
sponse on practice trials; w, the amount of need reduction occurring 


following a response occurrence; and N, the number of training trials. 
The function relating habit Strength to each of these variables is as- 
sumed to be exponential in form so that 


H =M exp t expt” (1 — exp w) (1 — exp N). 

Thus, in the acquisition of a particular Stimulus-response associa- 
tion, say a single vocabulary item such as the relation between the 
printed word hat and the spoken word “hat,” the habit to be estab- 
lished is that relating Spa to R.. The value of H for this habit would 
grow exponentially as a function of Practice trials; 7 would probably be 
negligible since little work is involved in verbal responses; and D 


1. Major Concepts and Postulates 99 


would depend upon instructions and other relevant motivating condi- 
tions, for example, degree of anxiety. If the habit associated with the 
correct association were the dominant one at a particular time, then 
the corresponding excitatory tendency would be an increasing function 
of D, and performance of the correct response would be positively re- 
lated to motivation. However, the rate of growth of H itself would be 
independent of motivating conditions except insofar as these might 
control the performance of behaviors conducive to practice as opposed 
to some interfering activity. 

The principal mechanism assumed to underlie the transfer of 
learning to new situations is stimulus generalization. This notion is 
represented in the theory by the assumption that, when a habit has 
been acquired relating a particular stimulus to a given response, the 
habit strength is automatically generalized to other stimuli which are 
similar to the given one, the amount of generalization being a function 
of similarity. Thus, in the example of the vocabulary item, once the 
association between the printed word hat and the oral response is 
learned, generalization should be expected to occur to a very high de- 


gree to stimulus displays in which the word hat is printed in slightly 
different type, or in capitals as opposed to lower-case letters, and this 
generalization will permit the subject to make the correct response to 
this printed pattern in a variety of new situations. At the same time, 
generalization, though in much smaller degree, might occur to the 
printed word hot, in which case it would lead to an incorrect response. 
Thus, in order to build up a capability to respond appropriately, 
without errors either of omission or commission, initial acquisition 
must be followed by discrimination training in which the association 
between hat and “hat” continues to be reinforced while the association 
between hot and “hat” is unreinforced. The consequence of the dis- 
crimination training would be that inhibition, / would build up for the 
incorrect habit, thus reducing the corresponding excitatory tendency; 
the values of E for the correct and incorrect habits would diverge until 
ultimately the response “hat” would be made only to the correct 
printed word and never to the incorrect one. 

In the major revision of the principles, presented in Hull’s Essentials 
of Behavior (1951), the two main changes that are of importance to the 
tion were the introduction of the concept of incentive 


present exposi x s 
f the relationship between excitatory 


motivation (K) and a revision o a 
tendency and the various independent variables. The concept of incen- 


tive motivation was introduced to take account of the past history of 
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š r 3 3 z n r at rewards had 
reward associated with the given habit, K being larger if rewards he 
P 5 * s ər anc 
been large on previous occurrences of the response member a 
smaller if previous rewards had be 


en of smaller magnitude. The forma 
relation between K and rew 


ard magnitude is the standard growth func- 
tion, K increasing from zero at a zero value of reward magnitude to 
unity at some indefinitely large magnitude. 

The new function relating e 


xeltatory tendency to the principal cor 
structs is 


E=kxXHxXKxD—1, 


where the parameter subsumes the effects of other variables. A mod- 
ification in the fine structure of the theory is that independent varia- 
bles other than number of practice trials are now assumed to influence 


excitatory tendency directly rather than by way of variation in habit 
strength. 


More recently Spence 


of animal learning data t 
namely 


(1956) concluded on the basis of various types 
hat K should enter in a slightly different way, 


E=kxX H(D + Y. 


A consequence of this change, if the 
as training on a given task proceeds, 
dent upon drive level than in the ori 
regardless of the stage of training, 
also does the value of E, and th 


Spence’s revision, since ince 


formalism is taken literally, is that 
performance becomes less depen- 
ginal theory. In Hull’s formulation, 
if the value of D approaches zero, S0 
© response must cease to occur. In 


ntive motivation is presumed to increase 


2. interpretation of individual differences in learning ability 
Now, how should we expect individual di 


to be treated within Hull's system? H 
problem explicitly on only one occasion, an 
sentation of his system in Principles of Beh 
1945 Hull sketched his view of the role of in 
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ences in a theory of behavior. In brief, his conception was that individ- 
uals differ, not in the presence or absence of various factors or mecha- 
nisms, but only in the values of the parameters representing relative 
weights of different factors or rates of growth of the various theoretical 
quantities. Thus any learner, whether normal, superior, or mentally 


retarded, should be expected to learn in accord with all the assump- 
ss would 


tions of the system, and a representation of his learning proc 
include all of the concepts, E. H. D, and so forth. 

A major locus of individual differences in learning ability is the set of 
parameters in the function relating the growth of habit to the various 
independent variables of Hull’s system. Thus one would expect to find 
a family of curves relating the growth of H to number of practice trials, 
with the curve rising most steeply for superior learners and most 
slowly for the severely retarded. 

Major questions that remain to be answered by appropriate research 
are whether variation in the various parameters is correlated or inde- 


pendent and whether variations in learning ability are reflected pri- 


marily in rate constants or in asymptote constants. 
Each of these questions has major ramifications. Regarding the first, 


it is conceivable that mental retardation might turn out to be asso- 


ciated with a deficit reflected in a very small value of some one param- 
eter, as for example the rate parameter in the equation relating H to N. 


On the other hand, it might turn out that the parameters vary indepen- 
r and that the retarded learner is 


dently from one individual to anothe 
arge number of low parameter 


one who happens to have an unduly l 


values. 
The answers to these questions are especially important with regard 


to implications for training and other remedial procedures. If it were 
found that the mentally retarded differed from normals primarily in 
rate constants, the implication would be that deficits of learning ability 
could be compensated by sufficiently great increases in amount of 


training or practice. If deficits proved to be associated with asymptote 


parameters, such as M in the function for growth of habit strength, the 
implication would be that the limits of learning are lower for the re- 


tarded than for normals and that the most one can hope for is to bring 
the retarded individual to his maximum performance as rapidly as pos- 
sible. In either event, it might be found that, within an individual, all 
parameter values tend to be correlated, so that if rate of growth of one 
habit is unusually low so also will be the rate of growth of others, or it 
might be the case that parameters relating to different types of habits 


102 Hull's Theory 


(as, for example, verbal versus nonverbal) are unrelated. It should i 
especially emphasized that the theory does not purport to deal net 
with these questions but only provides a framework for research which 
might yield answers to them. 
Another major possibility 
within Hull's System is that th 
not with regard to rates o 


which can be meaningfully expressed 
e mentally retarded differ from normals. 
r limits of habit acquisition, but with respect 
to other factors which determine performance. To the extent that thig 
state of affairs obtains, considerable analysis may be needed to Na 
mine just how variation between individuals in particular factors would 
affect skilled performance. To take one example, the slopes of stimulus 
generalization gradients might we 
lower generalization grad 
from a training task 
transfer might be 
ations. Howeve 
discussed aboy 


Il vary among individuals. A shal- 
ient would imply greater transfer of learning 
to a slightly different test situation, and this 
thought to be 


r, as illustrated 
e, generalization 


advantageous in most educational sie 
in the example of the vocabulary item 

can also lead to errors. Thus. while ut 
might be more adaptive for the organism to have a shallow generaliza- 
tion gradient with respect to the acquisition of fears, it would probably 


be more efficient to have steep generalization gradients in situations 
involving discrimination learning. 


With regard to inhibition, one’s first the 
bition tends to counteract habit Strength in the de 
atory tendencies, a low level of 
superior performance. However 
skill involves the ove 
be representec 


sired habits. Thus there would s 


inhibition would tend to accompany 
uations the acquisition of a 
which in Hull's system would 
> of inhibition associated with unde- 


reoming of errors, 


no sound reason for ex- 


evel of the inhibitory variable between 


> in contrast to 


> classical associationism, there 
are clear reasons to consider the bility that large differences be- 
arning ability might 


Possil 
tween individuals in apparent le 


from performance factors unrelated to learning in the 
most obvious candidate for this role would be 
which combines multiplicably with habit st 
formance. No amount of trainin 
changes in performance on any 


actually result 
usual sense. The 
nonspecific drive level D, 
rength to determine per- 
& can be expected to lead to major 


given type of task if the D level is so 
low that excitatory tendencies do not exceed threshold. But even more 


important than the nonspecific effect of D 


specifi on activity level is its pre- 
sumed function in discrimination and selective learning, 
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According to Hull's conception, any situation involving problem 
solving or discrimination learning necessarily also involves alternative 
responses on the part of the individual and thus a number of com- 
peting habits, some of which are correct and some incorrect from the 
standpoint of the requirements of a particular task. Within the theory, 
for the correct response and some habit 


if some habit strength ex 
strength for one or more incorrect responses to a given situation, the 
probability of the correct response depends not only on these habit 
strengths but also on the drive level. Since the difference in habit 
strengths would be multiplied by D, whether or not a given difference 
in habit strengths will yield a suprathreshold difference in excitatory 
tendency depends on the current drive level. 

the role of D in the acquisition of skills is double edged, 
for if at some stage in training habit strength of some incorrect re- 
sponse is greater than that for the correct response, amplification of 
the difference in habit strengths by a high D level will simply increase 
the probability of an error. Thus, individual differences in D should be 
expected to lead, not to an across-the-board difference in skilled per- 
formance, but rather to a pattern of differences changing with stage of 
training. 

Incentive motivation K might be an even 
than D for the investigator concerned with individual differences in 
mental development. Since the value of K, unlike that of D, is assumed 
to grow as a function of rewarded trials in a learning situation, indi- 
vidual differences in level of K would be less likely to yield a mixture 
of positive and negative effects on the acquisition of skills. At the be- 
ginning of training on any new problem, when habit strength for errors 
might be greater than those for correct responses, the level of K would 
presumably be low, and so also would be its effect as an amplifier of 
these differences in habit strength. 
for the correct response would normally be greater than 
that of any particular incorrect response, an increased level of K would 
have only beneficial effects since, as a multiplier of the difference of 
habit strengths, larger values of K would yield larger probabilities of 
correct responding. 

Thus individual differences which resulted in some individuals 
having lower rates of growth or lower asymptotes of incentive motiva- 
tion could have widespread effects on the acquisition of skilled per- 
formance. In fact, since different habits must be assumed to have dif- 
ferent growth rates within any individual regardless of his overall level, 
likely that individual differences in the growth rates or 


However 


more important variable 


At a later stage in training, when 


habit strength 


it seems quite 
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asymptotes of incentive motivation might be a much more likely source 
of broad deficits (or the conve 


rse) in apparent learning ability than indi- 
vidual differences in rate 


s or limits of habit growth. 

Another reason why the variable i 
tive focus of research lies in the possibilites of overcoming perform- 
ance deficits by remedial or therapeutic measures. To the extent that 
mental retardation is a matter of individual differences in parameters 
of habit growth, it might well be that little could be done to overcome 
deficits by modifying training procedures. However, there would be far 
more reason to think that much could be 
sate for differences in le 
that investigators workir 
direct major e 


K might prove an especially attrac- 


done to overcome or compen- 
vels of incentive motivation. One might expect 
ng within the framework of Hull’s theory would 
fforts toward unraveling the extent to which individual 
differences in intellectual functioning are attributable to habit, drive, 
and incentive variables. 


Now, to what extent have the potentialities of Hull's theory begun to 


be exploited in the directing of research on mental development? In 
order to evaluate progress in this respect, it will be useful to consider 
separately the relatively direct applications of the Hull-Spence theory 
and theoretical formulations of a general or special character which 
represent outgrowths or extensions of Hull's system. 


3. direct applications of the Hull-Spence theory to research 
on learning ability 


The initial attempts to apply Hull’s concepts to the analysis of 
mental retardation have been discussed in a systematic review by 
Cantor (1963), to which readers may also refer for a summary of the 
Hull-Spence system somewhat more comprehensive than the one 
given above. 

With respect to the most direct a 
Hull’s interpretation of individual di 
tion of differences between normal 
ters of the habit-growth curve, Cant 
example up to 1963. The study in question was one by Ellis, Pryer, and 
Barnett (1960), who reanalyzed data from Some previously published 
experiments on serial verbal learning by groups of mentally retarded 
individuals (averaging about 17 years of age with a mean IQ of about 
64) and normals of about the same age (with mean IQs of approxi- 
mately 103). With Hull’s concept of reinfore 
situation as occurrence of the correct respon 


nd straightforward application of 


fferences, namely the determina- 
and retarde 


d subjects in parame- 
or’s review 


discloses but a single 
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of results, probability of correct response was plotted versus number of 
previous reinforcements for a pooled group of 118 defectives and a 
group of 75 normals. The curves were well fitted by Hull’s habit-growth 
equation with a suitable choice of parameter values, and it was found 
that the curve for the normals required both a higher rate parameter 
and a higher asymptote parameter than that for the defectives. Thus 
the result is compatible with the conclusion that the normal subjects 
differed from the retardates in both the rates and limits of habit acqui- 
sition. However, interpretation of the result is complicated by factors 
which perhaps help explain why there have been few similar studies. 

As indicated by our previous analysis, one cannot actually determine 
the constants of the habit-growth curve from empirical learning curves 
of this sort, since the response measure, in this case, probability of 
correct responding, is a function of excitatory tendency which is in 
turn a function of drive and incentive factors as well as habit strength. 
The extent to which, for example, the observed difference between the 
two mean empirical curves could be a function of differences in drive 
level cannot be ascertained from these data. 

In order to get further insight into the theoretical locus of differences 


between the performance curves of the normal and retarded learners, 
it would be necessary to do studies of this sort, not in isolation, but as 
part of a series of experiments in which different variables were manip- 
ulated. It should be possible, for example, by suitable manipulation of 
the incentive conditions to determine the extent to which differences 
between the performance curves should be attributed to drive or incen- 
tive- motivational differences. Further relevant information could be 
obtained by using a paired-associate task and, after the learning func- 
tions have become asymptotic, interchanging stimulus-response as- 
signments among different items. At the outset of the second phase, 
the dominant habit with respect to each item would be one involving 
and any difference in drive level in favor of the 
increase their probability of errors 


an incorrect response 


normals would be expected to ea I 
rather than to increase their probability of correct responding. The 


es for experimental manipulations of 


system has substantial resource 
ted to be of considerable diagnostic 


this sort which might be expec 
value but which have not yet been exploited in relation to research on 
retardation. A 

Most of the applications of the Hull-Spence theory in the area of 
retardation have attempted, not actually to determine the parameters 
of theoretical functions, but rather to make comparisons between 
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groups of normal and retarded subjects on single response measures 
which might reflect differences in 
provided all of the rele 
parison. With regard te 
includes but a single 
situation from one 


theoretically significant variables, 
vant factors have been held constant in a Sts 
> stimulus generalization, Cantor's (1963) re 
study. Barnett (1958) adapted his e 
used originally by Brown, Bilodeau, and Baron ( 199 Š 
The subjects were required to learn to respond to the middle light ce 
row of 11 by operating a response button. Following a series of ee 
trials, the subjects were given test trials involving single mem er 
of all the lights in the display. Generalization gradients were obtained. 
with the tendency to respond to te 
distance from the training light in 
Hull’s theory (or any othe 
similar for the normal an 


st lights decreasing as a function 15 
the fashion qualitatively implied by 
r) but with the forms of these gradients very 
d retarded groups. Little can be concluded 
from this one set of data, of course, especially since the stimulus di- 
mension involved was position and it is well known that speed of 


learning simple positional discriminations differs little between norma 
and moderately retarded individuals. 


Within the framework of Hull's theory, 


motivation for pursuing this matter inte 
noted that in a different 


there is perhaps no special 
nsively. However, it might be 
context, Zeaman and House (1963) have postu- 
lated that a major difference between retarded and normal learners 


may be localized in probabilities of attending to different stimulus di- 
mensions. If their idea is correct, it might be expected that generaliza- 


tion functions on some stimulus dimensions would differ in slope as a 
function of learning ability. If, fo 


r example, retarded children on the 
average had lower probabilities of attending to color than normals, they 
would presumably be less likely to exhibit a generalization decrement 
when a test stimulus differed in color from a training stimulus and thus 
would exhibit shallower generalization gradients on that dimensi 
even though they might not differ from normals on generalization for 
position. 

Applications of the concept of inhibition are com 
of factors, not the least of which is a sharp difference in defnition of 
this concept between Hull’s original formulation (1943) and the later 
major reformulation of the theory by Spence (1956) which 
enced many applications of the system in various 
mental retardation. According to Hull, inhibition 
quence of responding, the tage of nation ace 
being a function of the amount of worl Involved in 


plicated by a number 


has influ- 
areas, including 
is a direct conse- 
ruing per response 
the given effector 
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activity. Inhibition is assumed to be built up by a constant increment 
upon each occurrence of the given response and then to die away expo- 
nentially to zero as a function of time between responses. Thus there is 
room for individual differences both in the amount of inhibition gener- 
ated per response and in the rate of decay. 

It is not entirely obvious just how differences in learning ability 
should be related to these parameters. On the one hand, a higher 
buildup of inhibition per response would lead to a decrement in re- 
sponding even during continued reinforcement and thus it would be 
deleterious to performance in many kinds of routine tasks, for exam- 
ple, the operation of a station on an assembly line. However, this factor 
might be interpreted as a performance variable unrelated to intelli- 
gence or learning as these attributes are usually conceived. 

In many situations, particularly those which require the learner to 
find a correct response by a process of trial and error, a greater incre- 
ment in inhibition per response might be adaptive since it would lead 
to more rapid weakening of initially strong but incorrect responses and 
thus facilitate the emergence of correct behavior. If an increased rate 
of buildup of inhibition were compensated by an increased decay rate, 
the individual having this combination would not be penalized on re- 
petitive tasks except under conditions requiring high rates of repetition 
of the same response. Some studies comparing performance of normal 
and retarded individuals on simple tasks provide evidence of just such 
a pattern. For example, Siegel and Foshee (1960) compared groups of 
normals and defectives on a task which allowed the subject to extin- 
guish the signal light each time it appeared by pressing any one of four 
switches. The defective group showed less response variability from 
trial to trial: this result, taken at face value, might suggest a smaller 
buildup of inhibition per response for the subjects of lower intelli- 
gence. A complication of this study however was the much higher av- 
age of the defectives. 
ed by Ellis (1970) defective and normal individuals 
5 ere compared on a pursuit-rotor 


erage chronological 


In a study report 


of about the same chronological age w 
task involving blocks of massed trials separated by a rest period and 


additional training either 1 or 28 days following initial training. The 
greater gains in performance over the longer rest periods for the 
normal subjects led to the conclusion that they must have built up 
greater amounts of inhibition per response during the massed-trial ses- 


sions. 


A similar result was reported by Jones and Ellis (cited in Cantor, 
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1963) although the latter study failed to confirm the associated predic- 
tion that the normal and defective subjects should have been differen- 
tially affected by the degree of distributie 
sessions. Possibly, however, the 
have differed in performance 
distribution of practice 


on of practice during the work 
prediction that the two groups should 
during training sessions as a function of 
is not well founded since, as we have indicated 
above, normal and defective subjects might well differ on both the 
rates of increment and the rates of decay of inhibition. There seem to 
be no results in the literature which would enable one to tease out 
these aspects separately. 

In Spence’s formulation (1956) the eff 
plicit attention and the notion of inhib 
cepts, the inhibition resulting from nonreward (and presumably related 
to the commonsense notion of frustration) and the inhibition of delay 
observed in certain Pavlovian experiments. Cantor (1963) feels that 
applications of these concepts to problems of mental development 
might be more fruitful than the use of Hull's conceptualization, but 
notes that almost no relevant work in that direction had been accom- 
plished. 

Very little work seems to have been dire 
working out the way in which parame 
determiners of habit growth vary 
by Jacobs (1950) compared institution 
and institutionalized defectives i 


fects of work are not given ex- 
ition is embodied in two con- 


neficial to performance in 


all groups, the absence of interactions indicated that the effect of delay 


was similar in all groups. 

Studies of incentive motivation have been rather e 
acter and lead primarily to the im 
concerning individual differences 
tives and the ways in which these 
effects of different reward magnit 
tion learning by defectives reporte 
mation concerning the relation of thi 
in learning ability, but does confir e ` 


xploratory in char- 
pression that little is yet understood 
in preferences for different incen- 
can be manipulated. A study of the 
-choice discrimina- 


ividual differences 
from Hull's theory 
on should be man- 
responses become 


tive motivati 


: g when correct 
dominant (that is at the higher habit Strengths). 


Research directed toward evaluatin 


„ ting the role of differences in drive 
level in learning of individuals of diffe 


rent ability has to date been pri- 
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marily limited to assessment of anxiety level as measured by the 
Taylor manifest-anxiety scale. A major and largely unsolved problem 
in this line of research is the difficulty that the responses to the Taylor 
scale must necessarily involve reading comprehension to an important 
degree and thus can scarcely be independent of intelligence level. In 
fact, some research has indicated that scores on this type of test may 
be as importantly related to IQ as to emotional or drive level as these 
are usually conceived. Following a review of the literature in this area 
Cantor (1963) suggested that it is unlikely that defectives as a group 
differ greatly from normals with regard to anxiety but at the same time 
proposed that more effort needs be directed toward determining 
whether high levels of anxiety might be an important source of interfer- 
ence with learning in the mentally defective. 

A notable deficiency in Hull's system is the absence of any mecha- 
nism for handling phenomena of retention in comparable detail to 
those of acquisition and the associated absence of any conceptual 
basis for differentiating short- and long-term memory. 

The potential usefulness of Hull’s system for research on mental 
ability is sharply limited by the fact that this theory is based primarily 
on animal learning data and has been interpreted in only an extremely 
narrow range of situations involving verbal and conceptual behaviors of 
human beings. Consequently, it is to be expected that other theories 
will have to be called upon to guide research of the types generally 
considered to be of most practical importance in connection with 
mental development, especially that having to do with the acquisition 
and use of language. Perhaps the greatest potentiality of the theory lies 
in its function as a conceptual tool for analyzing the roles of learning 
processes and motivational variables in the determination of perform- 


ance. 


VIII. Ellis’s Stimulus-Trace Theory 


example of a limited theory of mental deficiency 
Sis of a single concept drawn from Hull's theory is 
trace hypothesis (1963). Ellis’s idea appears to be that 


ormal and mentally defective individuals 


erformance ordinarily thought of as having 
mal experiments rarely exhibit appreciable 
ial learned in the laboratory- 
he common use of digit-span 
nce test batteries and similar 
ans of detecting “intellectual” 
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the connection with Hull’s theory seems largely toi h 
the concept has been redefined as a short-term eee been lost and 
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1. background and assumptions of the 1963 formulation 


The essentials of the original concept were that, following the occur- 
rence of any stimulus, a trace s,, persists in the nervous system for a 
period of the order of a few seconds during which it can provide the 
basis for a delayed response and can become conditioned to the re- 
sponses evoked by other stimuli. The magnitude and rate of the decay 
of the trace were assumed by Ellis to be a function of “central nervous 
System integrity.” Thus central nervous system defects of a sort to 
produce defects in learning ability were assumed to be associated with 
reductions in the magnitude of the trace from the normal level and also 
with an increase in the rate of decay, so that the duration of control of 
behavior by any given stimulus would be less for the subnormal than 
for the normal individual. 

Ellis took a number of types of experimental evidence as support for 
the idea that individual differences in the stimulus trace are important 
determiners of performance on learning tasks. One of these had to do 
with the serial-position curve in verbal learning. Although some early 
research (e.g., McCrary & Hunter, 1953) suggested that the form of 
the serial-position curve, with the maximum occurrence of errors near 
the middle of a list, was unrelated to ability levels, some later studies 
[including those of Noble and Fuchs (1959) and Barnett, Ellis, and 
Pryer (1960)] indicated that faster learners had a relatively greater 
advantage on items near the beginning or end of the list and made dis- 


proportionately more errors on the middle items. 

This last finding conforms nicely to expectation based on Hull's in- 
terpretation of the form of the serial-position curve. Stimulus traces 
should, on the one hand, permit responses from early items to persev- 
erate and cause errors on later items; but also these traces, dimin- 
ishing in magnitude as a function of time following occurrence of an 
item, should cause the effects of reward at the end of the list to exert 
graded effects on items in relation to their proximity to the end. There- 
fore, of two otherwise similar individuals, the one with the stronger or 
more persistent stimulus trace would be predicted to conform more 
closely to the pattern found for the superior learners in these latter 
studies. F 

One difficulty with this source of support is that other more recent 
studies (e.g., Girardeau & Ellis, 1964) have not found the predicted 
differences. Another difficulty is that factors other than the stimulus 
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S which can serve to bridge 
the delay interval rather than in some simple Property of the nervous 
system as implied by the stimulus-trace interpretation. 

Perhaps the most direct bit of evidence Providin 
lis’s 


original hypothesis was the finding by B 


O'Connor (1961) that the duration of blocking of the alpha rhythm fol- 
lowing presentation of a visual Stimulus was significantly greater in 
normal than in mentally defective individuals, ere is no logical 


connection between electroencephalographie data and the concept of 
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stimulus trace as defined in Hull's system, but one must concede that 
the finding of differential durations of alpha blocking is suggestive of a 
relationship between intellectual level and some type of trace activity 


which persists following stimulation. 
2. tests and implications of the trace theory 


In any event, the nature of the evidence which suggested the pos- 
sible importance of a stimulus-trace process in the analysis of learning 
deficits is not of great import. As Ellis indicated (1963, p. 145), the pri- 
mary test of this concept should have to do with its fruitfulness in 
yielding predictions regarding such matters as the relative difficulty of 
different types of learning problems between normal and defective in- 
dividuals. To this end Ellis and his associates have conducted a 
number of studies which test implications of the trace hypothesis. One 
of these was a study by Ellis and Zimmerman (cited in Ellis, 1963, p. 
147) in which a modified delayed-response situation was utilized. Pre- 
sentation of a colored light in the single upper window of a stimulus 
panel initiated a trial. Following a delay interval a row of lights of dif- 
ferent colors appeared in windows at the bottom of the panel. The sub- 
light from this set which matched in color 


ject’s task was to select the 
the sample that had appeared at the beginning of the trial. As antici- 
pated, performance declined as a reasonably orderly function of delay 
interval in a group of mentally defective adult subjects, and there was 
a modest relationship between MA and performance at the longer 
delay intervals. 

However, it should be noted that over 10 days of training there was 
improvement in performance on the part of all of the subjects, and 
rather drastic improvement in some cases, 80 that by the tenth day 
some subjects’ performance was approaching constancy over the range 
of delay intervals used. This finding does not particularly support the 
idea that the initially steep decrements in performance over delay in- 
terval were attributable to individual differences in a stimulus-trace 
parameter representing some relatively stable aspect of the nervous 
system. On the other hand, they fit quite well with the suggestion men- 
tioned above that individual differences in delayed performance de- 
pend to a major extent on corresponding differences in activities such 
as verbal rehearsal. 

A rather more direct test of the trace hypothesis was undertaken in 
a study by Terrell (cited in Ellis, 1963, p. 148) involving a simple reac- 
tion-time task with differing intervals between a warning signal and the 
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Š z i 3 ; trace pro- 
leaving an interval which would have to be bridged by some 
cess on the part of the subject. 


for a group of normal subjects 
the duration of the warning inte 
on-off versus persisting w 
mentally defective group 


instructed to respond. oe tne 
ained on until the reaction 


As expected, the reaction-time et 
declined only slightly as a masa he 
rval and was virtually unaffected ai a 
arning signal conditions. Reaction tnis z in- 
was higher at all values of the e in 
als and, more importantly, was much highe 
ting warning signal condition. . ihe 
ve result confirms Ellis's expectation n be 
sis. However, these data should perhaps c- 
tion in view of the unusual finding that rea 
unction of the interval between the gamins 
‘time stimulus over the range 2-4 sec, in co 


. ` od 
al result of an optimal interval in the neighborho 
of 2 sec. 


n 88 a ae ahat in 
A rather direct implication of the Stimulus-trace hypothesis is that 
well-controlled learni i i 


in determining rela- 
defectives. Ellis cites = 

far to fit in quite well with his interpretation. Ver- 
gason: (1964) and retarded children, 12- 16 


conditions in which the stimul 
associate item appeared simul 


embers of the paired- 
there was a delay of 2.75 S 


er conditions in which 
stimulus and response 


erve further investigation 
since it has generally been found that paired 
tively independent of intratrial time intéry 
values. 


3. Reformulation in Terms of Memory Processes 115 


3. reformulation in terms of memory processess 


On the whole, tests of the original formulation were encouraging for 
the idea that a specific short-term memory deficit might characterize 
the mentally defective, but not for an identification of the deficit specif- 
ically with the stimulus-trace concept as developed in Hull’s system. 
During the same period in which the work discussed above was con- 
ducted, a major acceleration of investigative activity concerning short- 
term memory appeared in the work of Broadbent (1958) and Peterson 
(1963), among others. The similarities in methods and results of these 
investigations and those of some of the studies designed to test the 
trace hypothesis became apparent and suggested the possible fruitful- 
ness of continuing the latter line of research within the theoretical 
framework developed by Waugh and Norman (1965) and Atkinson and 


Shiffrin (1968). 
In general outline, 


verbal materials in terms of operat 
trieval without attempting to reduce these to stimulus-response terms. 


Thus this theoretical framework takes a form much like the flow dia- 
gram of a computer program. The first stage involved in learning any 
particular set of material is conceived to be an attentional process, the 
1 the same sense as in the theory of Zeaman 
and House discussed below. Stimulus items attended to are assumed 
to be automatically registered in an immediate-memory system from 
which information is rapidly lost as a function of time, all traces of the 
input being lost within a few seconds unless some mental activity on 
the part of the learner transfers the material to a long-term memory- 
storage system. Ellis (1970) proposes distinguishing three phases in 
the storage process rather than the two (short- and long-term storage) 
assumed by Waugh and Norman, and by Atkinson and Shiffrin. In El- 
lis's schema the first phase of storage is termed primary memory and 
ated above for short-term storage in the other 
aterial may be transferred to secondary 


this family of theories analyzes the learning of 
ions of information storage and re- 


term being used in mucl 


has the properties indic 
theories. Primary memory ™ 


memory and then to tertiary memory- 
Secondary memory is assumed also to be a transient phase, but with 


a slower decay rate than the primary memory system; presumably it is 
responsible for retention over intervals of a few seconds, as for ex- 
ample those involved in digit-span tests. Tertiary memory corresponds 
to the notion of long-term memory storage in the other theories. Transi- 
tions from primary to secondary and from secondary to tertiary 
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window, thus activating an electrical contact leading to the recording 
system. 

Presumably an individual's ability to locate the position where the 
test or “probe” digit appeared should reflect short-term memory rela- 
tively independently of recall processes and should permit the inde- 
pendent investigation of exposure durations, time intervals between 
exposures, and time between the final exposure and the test. Prelimi- 
nary results reported by Ellis and Hope (1968) suggest that these objec- 
tives are quite well met. Their data on college students indicated that 
rate of presentation mainly affects memory for the early and middle 
positions in the series but not for the terminal positions, and that expo- 
sure duration and interval between exposures are approximately inter- 
changeable in their effects. 

The obyious interpretation in terms of the theoretical schema would 
be that increasing the presentation rate reduces opportunities for re- 
hearsal and thus reduces the transitions from primary to secondary 
memory that are needed for effective retention of the earlier positions 
in an exposure series. A delay between the last digit of a series and the 
presentation of the test stimulus primarily affects accuracy on the last 
exposure position, as would be expected if the perfect retention of the 
final stimulus normally observed at short test intervals is principally 


attributable to the primary-memory system. 
Comparisons between performances of normal and retarded individ- 


uals in this situation have yielded a striking pattern of results, and one 
of special interest in that very similar performance of normals and 
defectives and very large differences are both observed in the same 
situation (Ellis, 1970). In the first of a series of studies, college-student 
subjects were compared with a group of retardates of about the same 
age whose mean 1O was in the neighborhood of 60. Sequences of trials 
Were rum wiki) c vihis intervals between stimulus presentations 
and with an immediate test at the end of each exposure series. The 

differences between the normal and retarded 
subjects at both exposure rates- However, the pattern of the differ- 
ences is of more interest than the magnitude. When serial-position 
curves, that is the proportion of correct responses at each of the nine 
positions in the display, were plotted. very large differences appeared 
and retardates at the early positions but no difference 
These results seem rather negative for the earlier 
since memory for the final position should yield 
flecting a simple trace process. 


test data revealed large 


between normals 
at the last position. 
stimulus-trace notion. 
the data most closely re 
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In terms of the revised conceptual schema, one must conclude me 
normals and defectives are strikingly similar in primary memory. Tag 
principal question is whether the large differences observed at 3 
lier positions can be localized in the secondary-memory phase of 11 
general model. Since the normal subjects proved to be considera Ki 
more influenced by presentation rate than defective subjects, anc 
since the differences appear largely at the e 
quential display, Ellis (1970) has 


ciency hypothesis” as a succe ee 

esis. The essentials of the new hypothesis are that memory capac E : 
. . g z 2 . * 2 oes 

least in the primary and tertiary phases envisaged in his schema, e 


9 8 ls ‘Wit 
not differ appreciably between normal and retarded individuals: 
the retardates are characteri 


arlier positions in veal 
t zal strategy defi- 
proposed a “rehearsal strategy 5 
i i i s-trace hypoth- 

ssor to his earlier stimulus-trace hy] 


zed by a deficiency in the rehearsal strate- 
gies which are required to transfer information from primary to sec- 
ondary and secondary to tertiary memory storage. 

Some aspects of the new hypothesis have received considerable 
support from additional experiments in the serial-exposure situation 
(Ellis, 1970). When a group of retarded individuals was run for 12 
days in the same basic situation but with .5-, I., and 2-sec exposure 
rates for different blocks of trials 


able appeared. There was a learning effect over the 12 days of experi- 
ence, and it is perhaps significant that the improvement appeared only 


ns of the serial display, which would be 


compatible with the notion that the progre 


sented some modest success in improving re 

When exposure rate was held constant a 
sec were interposed between the final stimulu 
and the test stimulus, the recency effec 
stimulus position) was found to 
tion of delay for retarded than 
was there any systematic trend 
function of delay, tending to co 
at the earlier positions reflects 
system which can be quite shar 
tory storage which is reflected most cle 


sive improvement repre- 
hearsal Strategies. 

nd delays of 0,5, 10, or 15 
S of the exposure series 
t (that is accuracy at the last 
decrease considerably more as a func- 
for normal Subjects. For neither group 
in accuracy at the e 
nfirm the 


rom the very transi- 
arly in perf, 


ormance at the final 
position. 


The general conclusions from this Series of Studies would appear to 
be, firstly, that the retarded individuals h 


} 5 x ave a substantial deficit in the 
ability to retain information from stimulus displays Over intervals of the 


order of 0-20 sec. Further, when the display is Presented serially, the 
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Principal deficit for the retardates appears in the earlier positions. Var- 
lation in presentation rate produces large effects at the earlier posi- 
tions for normals but little or no effect for retardates, suggesting that 
the normal individuals take advantage of slower presentation rates to 
5 3 activities which improve retention of information 

earlier presentations of the sequence. These quantitative re- 
sults, together with observations of retardate performance reported by 
* that the retarded subjects engage in very little rehearsal 
activity, 


5. comment 


Interpretation of these findings in terms of the theoretical schema 
raises numerous problems which call for further analysis. Firstly, con- 
sider the data regarding retention of information presented at the last 
Serial position. It is clear that upon immediate test normals and re- 
tarded subjects, within the range tested to date, yield equal perform- 
ance at virtually 100% retention. If a test is delayed, the retention 
curve falls much more sharply for the retarded subjects. One possible 
interpretation, and the one evidently favored by Ellis, is that the first 
point on the retention curve for the final position represents primary 
memory and the remaining positions secondary memory, so that one 
might conclude that normals and retardates do not differ with respect 
to the primary-memory process. However, the data available at present 
do not clearly select this interpretation from a number of alternative 
Possibilities. 

Perhaps the most obvious alternative hypothesis is that the short- 
term retention function is an inverted S curve, relatively horizontal for 
a short interval following the stimulus exposure, then falling more 
steeply for a time before the transition to the final negatively acceler- 
ated portion. This possibility could be checked out by further experi- 
mentation in which shorter test intervals were used. However, it must 
be noted that no experiments of this type can localize the differences 
between individuals of different ability in a memory storage or decay 
process representing purely retention of information from the final 
stimulus of the series. For in all cases this presentation has been pre- 
ceded immediately by the others in the series and thus there is a clear 
possibility of the type of interference customarily associated with the 
notion of proactive inhibition. That is, the retention function for the 
final position may be influenced by the information input at earlier po- 
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sitions. If so, the divergence between the final-position retention 
curves for normal and retarded individuals might represent the emer- 
gence of proactive effects. In order to get at this possibility, it will evi- 
dently be necessary to make comparisons of retention functions for the 
final position with the number and spacing of earlier presentations in 
the sequence as an independent variable, and perhaps to investigate 
conditions in which the stimuli at earlier positions differ qualitatively 
from that at the final position. 

One particularly interesting possibility is that the differences be- 
tween normals and retardates in the terminal-position retention curve 
might be attributable entirely to the 


fact that the normals have stored 
information from the e 


arlier positions more effectively, so that it inter- 
feres less with retention of material from the middle positions. In this 
event the differences as a function of ability level in this situation 
might be entirely reducible to differences in ability to transfer informa- 
tion from short- to long-term me é 


mory storage rather than to differences 
in storage capacitie 


at any level. 

The idea that differences in short-te a function of 
ability level may depend to a major extent upon differences in re- 
hearsal activity is attractive both because it is 
tigation, as distinguished from hypotheses concerning differences in 
memory capacity, and because it entails possibilities of mitigating defi- 
ciencies in performance if ways can be found to improve rehearsal 
strategies on the part of defective individuals. However, before a start 
can be expected in the latter direction. nume 
answered by further research. 

Apparently retarded individuals rehearse 
situations. Is the same true of nonlaboratory Situations such as those 
involved in the acquisition of language or in school learning? In either 
event, why does the retarded individual engage in less rehearsal? Is he 
unable to rehearse effectively? If so 


rm retention as 


relatively open to inves- 


rous questions need to be 


less in standard learning 


is this a matter primarily of an 
innate limitation on his ability for such learning or is it a deficiency in 
earlier learning which might somehow be overcome? 

When the retarded individual does rehearse, does the rehearsal ac- 
tivity produce the same effects on memory Storage as the same activity 
in an individual with normal learning ability? This last question is es- 
pecially critical with respect to practical implications, for there would 
be no reason to engage in strenuous searches for methods of teaching 
rehearsal strategies to the mentally defective unless there were reason 
to believe that increased rehearsal on their Part would improve intel- 


lectual performance. 


IX. Denny's Elicitation Theory 


Although the approach which Denny (1966) terms elicitation theory 
stem, it fits into the 


is, perhaps, not a direct outgrowth of Hull's sy 
same general framework and shares both the advantages and limita- 
tions to a considerable degree. The conceptualization of behavior is 
strictly that of stimulus and response, with learning occurring by stim- 
ulus substitution, as in the theories of Pavlov and Hull. But with re- 
spect to the basic condition for learning, Denny follows Guthrie (1952) 
in assuming that simple contiguity of stimulus and response is both 
necessary and sufficient. 

The principal function of reward in a trial-and-error situation. as, for 
example, maze learning, is that it evokes approach to the location of 
the goal. Then simply through association by contiguity, approach re- 
sponses become conditioned to other cues near the goal and the pro- 
cess works backward over successive trials until exposure of the an- 
imal to any portion of the pathway will initiate a sequence of 


movements leading to the goal. The termination of pain is assumed to 


exert its effects by eliciting relaxation, which in turn reduces the likeli- 
hood of occurrence of competing responses that might otherwise be- 
come conditioned to stimuli present in the situation at the given time 
and interfere with or displace the response which led to termination of 
pain. 


Experimental extinction of learned habits is assumed to proceed 


strictly by counter-conditioning. That is, a response which has been 
learned to a certain stimulus will continue to be made upon recur- 
rences of the situation until some other response is learned in its stead. 

The important concept of an elicitation hierarchy has much the 
same properties as the habit family hierarchy in Hull’s theory. At any 
given time an organism is assumed to have a hierarchy of possible re- 
sponses to any given situation. Over time or over repeated trials in the 
same situation a primary process termed “satiation” is assumed to 
occur, temporarily decreasing responsiveness to the repeated stimuli 
and thus leading to temporary changes in relative strengths of re- 
sponses in the hierarchy. The shifts in the hierarchy produced by con- 


ditioning and counterconditioning are assumed to be lasting changes. 


Stimulus generalization from one situation to another is assumed to 


121 


122? Denny's Elicitation Theory 


occur in much the same manner as in Hull's theory but the importance 
of stimulus sampling receives more emphasis in Denny's formulation. 
On the basis of an earlier review of the literature on learning studies 
involving the mentally defective (Denny, 1964) and, no doubt. of his 
own more recent experimental work with severely retarded subjects. 
Denny suggests that the main defects in learning on the part of the 
mentally retarded are attributable to limitations on duration of atten- 
tion and inability to delay response to a stimulus, deficiencies in ca- 
pacity for symbolic behavior, and a broad deficit in inhibitory pro- 
cesses. He further suggests that the last named, the inability to inhibit 
responses, may in turn be responsible to a considerable extent for the 
other types of deficiency. In application to problems of mental retarda- 
tion, Denny’s theory is very similar to that of Hull. It is directly appli- 
cable primarily to the learning of simple habits. The types of proce- 
dures and the types of analyses that would be suggested by Denny's 
and Hull’s theories would seem to diffe 


r largely in matters of emphasis. 
Denny giving more explicit attention to the role of competition among 
alternative responses in all types of learning situations. 

On the basis of these assumptions, together with firsthand experi- 
ence in training situations, Denny proposes the following as the major 
guiding principles which need exploitation in training the mentally re- 
tarded. 


l. elicitation of the correct response 


The first concern in every training situation must be to evoke the 
desired response from the learner in the presence of the stimuli which 
are later to signal it while at the same time minimizing the occurrence 
of incorrect responses. Several means for achieving these ends have 
been developed in various experimental contexts. Most important, per- 
haps, is the use of “crutch” cues and “fading” techniques (Terrace, 
1963a. b). To illustrate, suppose that a child is being trained to use the 
words “light” and “dark” appropriately to describe stimulus cards of 
different shades of gray. If discrimination is difficult, one might begin 
by putting a black patch on the darker card: then, after this relatively 
easy discrimination is mastered one might gradually reduce (“fade 
out’) the black patch over a series of trials by steps sufficiently small 
that the learner never begins to make errors until, ultimate 


ly, correct 
responses are being made to both shades of gray. 


: ! Another technique 
suggested is the use of physical barriers to prevent the occurrence of 
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incorrect responses during early stages of learning in trial-and-error 
situations. 


2. immediate knowledge of results 


In either learning experiments or training situations knowledge of 
results following correct responses ordinarily is conveyed by means of 
rewards which, among other things, indicate to the subject that his 
appropriate. Following incorrect responses, knowledge of 


response We 
wrong.“ In 


results involves an indication of the error, such as saying 
immediate knowledge of results following 
edure allows the subject to make 


terms of elicitation theory 
errors is clearly important if the proc 
a correction since this would insure the elicitation of the correct re- 


sponse in the given stimulus situation. In a trial-and-error situation 


finding a route to a goal, rewards following cor- 
itation theory since 


where the task involves 
rect responses are clearly important in terms of elic 
they evoke approach responses in the vicinity of the goal, and these 
responses may in turn become conditioned to more remote stimuli. 
When the responses are nonspatial, and especially when they are ver- 
bal, it seems less clear how to interpret the function of reward in this 
system. To the principle of immediate knowledge of results, Denny 
adds that of differential feedback. It is not entirely clear to this re- 
viewer how the two concepts are to be differentiated except, perhaps, 
that differential feedback may serve simply to increase the number of 
stimulus elements in a situation which become associated with the de- 


sired response. 


3. stimulus generalization and positive transfer 


In harmony with the ideas of Guthrie and of stimulus-sampling theory 
(Estes, 1959), Denny emphasizes the importance of increasing the 
likelihood of transfer of training to new situations by giving training 
in a variety of contexts. For example, in teaching the name of an ob- 
ject, essential aspects of the training are to elicit the correct name in 
the presence of the object in various background situations and to use 
various instances of the same object (for example, a variety of tables if 
the word “table” is being learned). Transfer is expected to increase if 
this training is spread over a number of sessions rather than being 
highly concentrated in time, since distributing the practice increases 
the likelihood of changes in stimulus context and therefore of the asso- 


ciation of more different stimuli with the correct response. 
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Further, in order to establish appropriate usage in verbal behavior. it 
is important that a particular response be repeatedly associated with a 
particular stimulus while other features of the situation vary randomly. 
If, for example, a child were taught to respond to a particular com- 
mand with the training always given under precisely the same circum- 
stances, learning would appear to be rapid, but the effects of the 
learning would disappear as soon as the child was tested in another 
situation. 


4. motivation (elicitation hierarchy) 


As in all learning theories, motivation is recognized to be important, 
but in the elicitation approach the emphasis is upon external incen- 
tives which may be expected to evoke vigorous behaviors from the 
learner rather than upon internal conditions of drive or need. Variety 
of stimulus presentation is expected to have as one of its benefits a 
slowing of satiation processes which might otherwise be expected to 
lead to reduced attention to stimuli relevant to the learning task. Re- 
wards are considered important, not simply as indicators of correct 
responses within a particular segment of a learning task, but also as a 
means of evoking approach responses toward the learning situation as 
a whole. 


5. sequential building 


This undoubtedly important principle, which in Denny's presenta- 
tion is only cursorily described, relates to the strategy of beginning 
with things which a given child finds it easy to learn 
easy steps toward more difficult goals, 
transfer of training whenever possible. It would seem that a learning 
theory should have much to say of a rather specific character about the 
nature of sequential building, the advantages of different types of se- 
quences, and the like, but these problems seem 


and building by 
taking advantage of positive 


not yet to have been 
dealt with explicitly in the case of elicitation theory. 

Only with respect to the first of these Proposed conditions do Den- 
ny's recommendations differ substantially from those that would follow 
from most other classical learning theories. The use of “crutch” cues 
and of fading techniques can undeniably be effective, but also the use 
of these methods can involve considerable hazards. As will be illus- 
trated in the following section, the use of a fading technique in a dis- 
crimination problem can result in the learner’s becoming fixated on an 
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incorrect response pattern. It is essential that the fading be done in 
such a way that the learner is not led to attend to an irrelevant stim- 
ulus dimension (see pp. 79-80). Further, the use of barriers, physi- 
cal or otherwise, to prevent incorrect responses during early stages 
of learning seems intuitively appealing but can involve undesirable 
and unexpected side effects. For example, it is well known in maze 
learning experiments that an animal which has been prevented from 


taking a particular pathway by means of a barrier, and thus coerced 
; to food reward over a series of trials, 


into following a correct pathw— 
may have extremely high probability of taking the incorrect path 
the first time the barrier is removed (see, e.g., Denny, 1957; Estes 
& Schoeffler, 1955: Zeaman & House, 1951). A possible solution to this 
difficulty, which seems not to have been studied experimentally, is to 
eliminate a barrier gradually, by analogy with the fading technique. 
This possibility needs both an experimental and theoretical analysis. 
Denny has applied his theoretical approach to retardate training, not 
be a series of experiments in which it is compared to other theories, 
but by developing a simple training situation and working intensively 
with a small number of severely retarded children over a considerable 
time (Denny, 1966). His teaching device is a multiple-choice apparatus 
which permits simultaneous presentation of several stimuli, for exam- 
ple, pictures, on a display panel and permits the child to respond to 
sponse bar. Correct responses are indicated by 
knob at the subject’s end of the response 
the child is simply taught to respond 
` by moving the bar until it 


these by operating a re 
illumination from within the 
bar. At the beginning of training, 


to simple commands such as “go to green 
touches the green stimulus patch, crutch cues being supplied by the 
form of gestures or manual guidance. 


experimenter initially in the 
s to increasingly complex commands and 


Then appropriate respon : : 
pts having to do with color, form, spatial relations, and 


simple conce k 1 A 
a regimen guided by the principles given above. 


the like are taught in e pri ; 
2 ‘ormal quantitative assessment is given, it appears that 
though no formal q 


progress has been made in teaching a variety of simple 

= ¥ ë 2 8 
everal children with IQs in the range of 
the future possibilities of 


considerable 
behaviors and concepts to $ á 
20-30. Denny indicates optimism regarding 


more expanded programs following the same principles. 


X. The Social Learning Approach 


1 E E 155 p in his 
Typically the retarded individual differs from normals not only in 
cognitive functioning in laboratory 


1 $ 8 2 8 á aTi unc- 
situations but also in his reactions to other people, his ability to fun 


. . sarning 
experiments or in school learning 


tion in a social group. and in the dispositions, habits, and attitudes ct 


tomarily associated with character and personality. At present it is 
scarcely possible even to begin analyzing the development of person- 
ality and of social interactions in terms of the same theories that have 
been applied to the more elementary cognitive processes. Further- 
more, proposed new theories at this leve 
dards of rigor and 


l do not meet the same stan- 
specificity of prediction, Nonetheless, it is as essen- 
tial at the more complex as at the more element 
organize research by theoretical concepts. 

Of the currently influential 
development, the one 


ary levels to guide and 


theoretical approaches to personality 
most amenable to discussion in the same context 
with theories of learning and memory 


is that which has been termed 
the “social-learning approach.” The 


principal basis for our present 
examination is the review by Cromwell (1963). 


l. social learning theory 


The immediate conceptual basis of the 
Rotter’s “social learning theory (1954)," This formulation is not a 
learning theory in the same sense as those of Hull or Spence but rather 
a composite of concepts drawn from the theories of Lewin (1935), Hull 
(1943), Miller (1959), and Tolman (1932) and applied to the interpreta- 
tion of personality development and the social functioning of the indi- 


social learning approach is 


vidual. F f 

The principal theoretical concepts drawn from le 
reinforcement value and expectation. The primary 
the tendency for behavior to occur in any given sit 


arning theory are 
assumption is that 
uation is a function 
of the individual's expectation of reinforcement in that situation and of 


the value of the reinforcement. The notion of value is somewhat more 
Te wR 5 5 $ 
tended than that involved in the conc ept of reinforcement or reward 
extende 
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magnitude in learning theory. Reinforcement value reflects not only 
the value of the given outcome relative to some need or motive but also 
the individual's expectation that the reinforcement will lead in turn to 
other reinforcements and of the values of these. Expectation of rein- 
forcement in any situation depends partly on generalization from other 
umed that behaviors are organized into 


experiences. Further it is as 
clusters or systems in such a way that different clusters of behavior are 
interchangeable if they lead to the same rewards. 

Thus, the social learning approach constitutes, in effect, an outline 
for a theory together with some guiding ideas as to the types of con- 
cepts and relations that may prove important. I believe it would be a 
mistake to attempt to evaluate it in terms of its present ability to lead 
to specific predictions of behavior. However, its function in guiding 
research in the area of mental retardation is of considerable interest. It 
is to this aspect that Cromwell's review gives major emphasis, summa- 


rizing the preliminary application of the approach to a number of 
studies and sketching some of the revisions in provisionally formulated 
concepts that have been suggested by these applications. 


1 9 55 
2. research in “social learning 
Preliminary studies by Cromwell and his associates (Ringelheim, 


1960: Starkman & Cromwell, 1958) explored ways in which the men- 
tally retarded individual's learning is affected by previous successes 


and failures. Attempts to demonstrate positive or negative effects on 
subsequent speed of learning proved essentially negative. Also, only 
rather inconclusive results were forthcoming from studies intended to 
evaluate the notions that retardates are selectively impaired in their 
ability to conceptualize themselves as failing or not failing on a task or 
that retardates more than normals tend to reward themselves verbally 
rather than responding to actual rewards. 

The question arising rather directly from consideration of the basic 
ideas of the social learning approach is that of whether the retarded 
individual has a lower general expectation for success and a higher 
generalized avoidance tendency than the otherwise comparable 
normal. If the answer to the first part of the question is affirmative and 
groups of normal and retarded subjects are matched for initial per- 
formance on a task on which correct performance is to be rewarded, 
then the retarded groups should improve relatively more than the nur 
mals over subsequent trials. The retarded subjects, beginning with a 
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lower expectation of reward. would gain more from the experiences 
with the actual rewards obtained. Conversely, if training is given with 
punishment for errors, then retarded individuals should be affected 
less than normals since they initially have a higher expectation of 
failure. 

It must be noted that, as in other applications of the social learning 
approach, the predictions are by no means unambiguous. If the levels 
of general expectation at the 


f e > beginning of a particular experiment 
differ for retardates and normals 


because the former are slower in 
building up such expectations as a function of reward or punishment 
experiences, expectations might then be affected less than those of the 
normals by the rewards received in the particular experimental situa- 
tion. In any event, Heber (1957) attempted to test these predictions in a 
simple reaction time experiment. The re 
analysis with respect to dat 
the retarded subje 


sults were confirmatory for the 
a from success trials, the reaction times of 


s decreasing more rapidly over trials than those of 
normals. However, an unexpected trend appeared for the failure trials 
on which the reaction time for the normals went much below that for 
the retardates and then regressed somewhat late in the series. The 
latter finding, confirmed by Gardner (1966) in a card-sorting task, sug- 
gests that perhaps the normal individuals have le 
initial failure in a task by increa 


arned to respond to 
sed effort whereas the institutionalized 
retardates have learned to respond to failure 
tion so far as possible. 

Other experiments altempting 


by avoiding the task situa- 


to bring out consistent differences 
between normals and retardates with respect to avoidance tendencies 
have yielded rather inconsistent results and led to the formulation of 
what Cromwell (1963) terms a developmental hypothesis. The gist of 
the latter is that the very young child reacts only to punishments and 
rewards per se; that is, he is influenced simply by the extent to which 
they affect his immediate drives and motives, lead to pleasure or pain. 
With age the child learns the relationships betwee 
and consequences and comes to realize the 
control the occurrence of reward or punishm 
perience of success or failure. 


n his own behavior 
degree to which he can 
ent and therefore the ex- 


sue ; Asa consequence the older child de- 
velops motivation to engage in activities that permit him to demon- 


strate his ability to succeed and learns to avoid situations where his 
feeling of competence may be threatened. 

A study by Bialer (1961) attempted to obtain confirmation of the de- 
velopmental hypothesis by studying normals and retardates of various 
ages in two task situations. The first was a Tepetition-choice situation. 
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The subject was given two puzzles to complete, and was allowed to 
complete one (being informed of his success) but was interrupted in 
the course of the other (this being described to him as a failure). The 
subject was then given the choice of returning to the failed task or to 
the completed one. The other experimental task was a choice situation 
permitting the subject to choose between a smaller immediate and a 
larger delayed reward. Bialer found that higher CAs in both normal and 
retarded children and to an even greater extent higher MAs were asso- 
ciated with a greater tendency to return to the incomplete (thus ini- 
tially failed) task and to choose the larger delayed reward over the 
smaller immediate reward. These findings were taken to support the 
notion that the individual of higher MA is less dependent upon the 
immediate effects of rewards and punishments per se. Rather, he tends 
to characterize these events to himself in terms of success and failure 
and is influenced by them in accord with his relatively generalized and 
persistent habits of responding to successes and failures. 

Continuations of this line of research by Miller (1962) and others 
were principally concerned with comparisons of groups which differed 
in a variable termed “locus of control.“ This concept refers to the 
tendency to perceive success and failure as results of external agen- 
cies as opposed to reflections of the adequacy or inadequacy of the in- 
dividual’s own efforts. Relative standing on this variable was evaluated 
by means of a paper-and-pencil task developed by Bialer and Cromwell 
(1960). 

For groups of normal children, Miller (1961), using a serial-learning 
situation, found no differences between groups tending toward external 
locus of control (ELC) and groups tending toward internal locus of con- 
trol (ILC) under conditions of success emphasis, that is, learning with 
rewards for correct responses but with no punishment. However, a 
large advantage for the ILC group appeared under failure emphasis, 
that is, learning with penalties for errors but no rewards, and under 
neutral conditions with neither rewards nor punishments. A shift from 
failure to success emphasis produced much the greatest facilitative 
effect on the ELC groups. Thus, this study seemed to show that it is 
possible to characterize subjects by an individual-difference variable 
which has considerable predictive value with respect to the individu- 
al’s dispositions to respond favorably or unfavorably to reward or pun- 
ishment, success or failure. The major questions of how locus of con- 
trol develops in the child and whether differences in this variable are 
of great import for the study of mental retardation have not yet been 


directly attacked. 
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3. a tentative formalization 


The conclusions from the various studies conducted within the gen- 
eral framework of the social learning approach have led Cromwell to a 
set of postulates and theorems which he believed might be useful in 
extending this area of research. Examples of the postulates follow: 

I. The initial system of motivation of the human organism is to avoid noxious 

events and to approach satisfying events. .. i 

II. The cumulative associations of the outcomes of goal-approach and threat-avoid- 
ance events with one’s own behavior evolve and become conceptualized over a period 
of time into a system of motivation to demonstrate one's own behavioral 

effectiveness... 

III. Independent of developmental level, the magnitude of the earlier system 
versus the later system determines the relative 


sensitivity (responsivity) of the indi- 
vidual to cues of hedonistically positive or negative circumstances versus cues TE- 
lating to success and failure (degree of behavioral effectiveness) in the individual, re- 
spectively (Cromwell, 1963, pp. 78-80). 


Examples of “theorems” follow: 


I. Verbal responses expressing an inte 
age. 


rnal locus of control increase as a function of 


II. The tendency to choose success (behavioral effectiveness) under hedonistically 
negative circumstances increases as a function of age. 
III. The tendency to avoid failure unde 


r positive hedonistic circumstances will in- 
crease as a function « 


age. 
IV. To the extent that mental age is a me 


asure of the conceptual development of 
the individual. its magnitude will be more 


related to the locus of control and other 
success-approach tendencies than chronological age (Cromwell, 1963. pp. 79-80). 
Much as in the case of Rotter’s (1954) earlier presentation of “social 
learning theory,” it does not seem sensible to this reviewer to evaluate 
Cromwell's collection of postulates as though it were directed pri- 
marily toward formal theory construction, The postulates are not s€- 
curely grounded in empirical results, but rather represent trends that 
Cromwell and his associates believe to be discernible in a mass of fre- 
quently conflicting findings. The postulates contain many undefined 
terms and the theorems are simply suggested implic 


ations and are not 
logically derived from the postulates. 


The spirit in which Cromwell 
seems to have put forth the set of postulates is rather to characterize 


the degree of progress toward theory construction in this area and to 
aid in the direction of continued research, 


4. comment 


Although it is a sociological fact that investigators in various areas of 
psychology tend strongly to develop their own disciplines in relative 
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isolation from others, one cannot help being struck by the possibility 
that the kinds of research most needed in further exploitation of the 
social learning approach might be greatly furthered if there were more 
utilization of progress in related disciplines.” For example, during the 
past decade, experimental and theoretical investigations of learning 
and motivation at both the behavioral and physiological levels have 
yielded major advances in understanding some of the mechanisms and 
processes involved in the control of behavior by reward and punish- 
ment. Surely progress toward understanding the development of social 
learning would be accelerated by more effective utilization of this body 


of relevant information and theory. 

The investigation of personality development in the mentally defec- 
tive, as well as in normals, must be a major long-term research goal. 
Nonetheless it is hard to avoid the conclusion that more methodolog- 
ical groundwork is needed before one can realistically expect this area 
to begin yielding constructive results comparable to those already 
demonstrable in the studies of more elementary learning and cognitive 
processes. None of the studies covered in Cromwell’s extensive review 
has reported any large effects upon the performances of mentally 
defective individuals from the manipulation of success or failure expe- 
riences or the other variables that are taken to be important in the so- 
cial learning approach. 

On the positive side, the indications that personality variables which 
can be independently scaled may predict an individual’s responsive- 
ness to such major determiners of performance as reward and punish- 
ment suggest the possibility of important long-term benefits from con- 
tinued exploitation of this approach. It seems that the major potential 
contribution of the approach must be in locating and studying the ex- 
periences during the earlier development of the child which lead to the 
establishment of motives for achievement, reward values, and respon- 


sivity to success and failure. 


2Some substantial progress in this direction is reported in a theoretical essay by Ge- 
wirtz (1969) which appeared after the present volume was in press. as 


XI. Field Th eory 


Although numerous studies of performance and learning in the re- 
tarded have been conducted under the influence of the viewpoint asso- 
ciated with “field theories” in psychology, it remains difficult ta 
elucidate just what field theory is or how it is applied to the direction of 
research. In general, the view of learning within gestalt theories has 
been that an individual's performance is determined largely by the way 
in which he perceives his environment, and that perception is in turn 
determined by organizational principles having to do with goodness of 
figure, perceptual closure, relations between figure and ground, and 
the like. Memory is essentially a residue 
and memory traces undergo changes in the course of time in accord 
with similar principles of organization. 

Early interpretations of mental development in terms of gestalt prin- 
ciples are largely associated with Lewin and his students. Lewin (1935) 
and Kounin (1941) elaborated a conception of progressive differentia- 
tion of an individual’s perceptual and cognitive activity with age and 
interpreted mental deficiency as retardation in this differentiation pro- 
cess with the consequent “rigidity” 
cesses. The concept of rigidity seems not to have been fully spelled out 
operationally, but has to do with tendencies to react to problem situa- 
tions by falling back on perseverative or stereotyped behaviors. 

The idea that the maladaptive behavior characteristic of the men- 
tally deficient is a direct result of retardation in a developmental or 
maturation process has never found appreciable support in experi- 
mental evidence, and has been extensively criticized by Zigler (1962), 
who interprets the data seeming to support 
flecting complex motivational factors, Zigler pr 
alized retardates have higher generalized avoid 
adults than normal individuals and at the same 
social approval, of which they are characte 
suggests that these strong sources of motiva 
ride those conducive to effective learning 
lead to rigid or stereotyped behaviors. 


of perceptual experiences, 


of cognitive and perceptual pro- 


this conception as re- 
oposes that institution- 
ance tendencies toward 
time higher desires for 
tistically deprived. Zigler 
tion conflict with or over- 
and problem solution and 
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Neither approach to the loosely defined collection of symptoms of 
intellectual malfunctioning subsumed under the term “rigidity” has 
given rise to a great deal of fruitful research. The developmental con- 
ception is almost immune to investigation until developed in much 
more detail, and even then would have to derive support largely from 
the exclusion of alternative interpretations. The importance of motiva- 
tional factors in the performance of the retarded is probably difficult to 
overestimate. With regard to the motivational interpretation, it seems 
all too easy to demonstrate characteristic differences in motivational 
patterns between people living normal lives and the mentally deficient 


living in institutions or in other severely limited environments but almost 
insuperably difficult to begin uncovering the role that motivational ne- 
tors may have in the etiology of mental deficiency. As will be seen in a 
later section, the presently more feasible task of finding ways of mod- 
ifying motivational conditions in the direction of facilitating adaptive 
behaviors and improving intellectual functioning has been undertaken 
largely within the operant conditioning approach. 

In a systematic review of research on retardation that has been con- 
ducted under the influence of a field theory, Spitz (1963) finds some 
support for this approach in laboratory studies conducted to examine 
developmental trends in such processes as perception of illusions and 
reversible figures. The single concept from gestalt theory most promi- 
nent in this line of research is that of cortical satiation, originally de- 
veloped by Köhler and Wallach (1944) to denote the buildup of cortical 
inhibition consequent upon continued stimulation. In a number of 
studies, Spitz and his associate have found support for their hypothesis 
that mentally retarded individuals are slower than normals in both the 
buildup and decay of cortical satiation as reflected in, for example, 
speed of locating hidden figures, rate of reversing reversible figures, 
and optimal intervals for successive masking in tachistoscopic percep- 
tion (Spitz & Blackman, 1959; Spitz, 1959; Spitz, 1969). 

On the basis of these studies Spitz (1963) has proposed the following 


four “postulates” to characterize mental retardation within the frame- 


work of field theory: 


1. Changes of state of groups of cortical cells in response to stimula- 


tion are slower for retardates than for normals. 
2. Following offset of stimulation the return of cortical cellular activ- 
ity to the normal state is slower for retardates than for normals. 
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3. In the retardate it is more dificult to change once established 
cortical patterns into new patterns as a result of changes in stimulation. 

4. In the retarded there is less spread of activity from stimulated 
cortical cells to the surrounding cortical field. 

The immediate implications of these postulates are presumed to be: 
(a) The mentally retarded individual should learn less in a given 
number of trials than the normal individual: (b) the retarded should 
forget relatively less of material once learned: (¢) the retarded should 
exhibit poorer transfer of learning to new situations: (d) the retarded 
should exhibit higher thresholds for 
ment. 

It is not difficult to see that the 
plausible translation of 


such phenomena as apparent move- 


se postulates represent a somewhat 
a number of commonly observed experimental 
findings into terms having to do with cortical cells and the like. How- 
ever, unless much more explicit rules can be provided relating the as- 
sumed cortical phenomena to behavior, it is difficult to see that this 
type of formulation could be expected to serve as more than an heu- 
ristic aid in research for individuals who find it int 

In a later review, Spitz (1966) goes rathe 
lating some of the ideas drawn from gest 
patible with those of 


uitively appealing. 
r further toward reformu- 
alt psychology in terms com- 


other learning theories and more nearly 
translatable into empirically testable 


Propositions. A new series of in- 
vestigations star 


from the gestalt 


view that memory is aided by the 
isolation of material from conte 


xt and by organization into meaningful, 
cohesive units in accord with much the same principles that govern 
Within this framework one is led 
to inquire concerning the extent to which ‘mental E E TEA be 
due to the inability of retarded individuals to } 
memory in ways which resist decay 
time. 

A convenient experimental situation for preliminary- analyses Wes 
provided by the standard free verbal recall experiment. Studies by 
Bousfield (1953), and Tulving (1962), among others, have shown that in 
recalling lists of objects or words, normal adults tend to recall the 
items in groups or clusters associated with meaningful ER Sage oa 
the extent of such clustering tends to increase With 
recall, and that there is a positive correlation be 
clustering and the amount recalled. These finding: 


organizational aspects of perception. 


organize material in 
or distortion with the passage of 


practice in free 
tween the extent of 
s led to the hypoth- 
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esis (Miller, 1956: Spitz, 1966) that repetition may be important pri- 
marily in providing the individual with an opportunity to group and 
otherwise organize material in memory. 

Early studies of free recall in relation to 10, using pictures or fa- 
miliar words as stimuli, revealed little obvious variation in the amount 
of recall or amount of clustering over a considerable range of IQs. 
Later work, mainly using verbal stimuli, established that educable re- 
tardates both recall and cluster less than normals of equal CA. When 
MA is equated, differences in recall disappear: however, the retarded 
groups tend to exhibit less clustering if the measures of clustering are 
corrected for “categorical intrusions.” This last term refers to the 
tendency to make errors of commission on recall tests by giving words 
which were not in the lists presented but belong to the same mean- 
ingful categories as words which were in the list. 

Spitz was led to a particular set of studies by the observation that 
college students’ superior recall performance is accompanied by very 


high tendencies to group and to cluster items in recall as compared to 
normal children or retarded individuals of similar CA. On the assump- 
tion of a causal relation between clustering and recall, it seemed pos- 
sible that recall in retardates might be substantially improved experi- 
mentally by measures leading them to do more grouping or clustering. 
Two procedures suggested by this hypothesis. one involving presenting 
items grouped in meaningful categories and the other guiding the sub- 
ject’s recall by requesting him to give items by categories, both led to 
very large increases in recall performance on the part of children with 
IQs of 60-70. 

With respect to paired-associate learning, recall strategies are not as 
obvious, but the interaction of meaningfulness (or difficulty) of material 
and IQ as predictors of performance suggests that the retarded individ- 
uals are at a disadvantage in producing mnemonic devices to facilitate 
recall. Following out this idea, Jensen and Rohwer (1963a, b), working 
with adults of IQ of 50-60, produced substantial increases in rate of 
ng the subjects to form sentences relating the 


acquisition by instructir jects sent 
stimulus and response members of paired-associate items. And Jensen 


(1965) found that retardates given specific instructions to use this type 


of mnemonic device performed better than normals of equal age whig 
had learned without the aid of special instructions. 


Even memory span for digits, a subtest in many intelligence Seales 
has proven susceptible to the same type of manipulation. Spitz (1966) 
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found that the recall of digits by retardates was substantially improvec 


by instructions to group the digits (a technique evidently generally 


used spontaneously by normals). 


On the basis of the encouraging results obtained with even small 


amounts of instruction in the use of mnemonic devices and organiza- 


Spitz has arrived at th 
most promising one, 
which retarde 


h x : ee ee 
tional strategies, © suggestion, to my mind a 


that long-term studies need to be undertaken in 
d individuals are given sust 
order to ascertain whethe 
ties for intelle 


ained training of this kind in 
r substantial improvements in their capabili- 
ctual functioning might be effected. 


XII. The Operant Conditioning Approach 


Though not a theory in the technical sense, the body of doctrine as- 
sociated with operant conditioning procedures and concepts has be- 
come extremely influential in guiding a major segment of research 
upon animal performance in relation to learning and motivation and 
the extension of the experimental results to the practical control of 
tem, originally formulated by Skinner (1938), falls in 
ssociationistic tradition and differs from that 
upon the inference of under- 


behavior. This s 


the stimulus-response- 
of Hull mainly in placing less emphas 
lying states or processes from performance and more emphasis upon 
the teasing out of empirical relationships between observed behaviors 


and controlling variables. 


1. concepts and principles 

The principal systematic independent variable of the system is rein- 
forcement, a reinforcement being any consequence of a response which 
leads to an increase in probability of that response upon future recur- 
rences of the same situation. Reinforcers are defined simply in terms 
of their effects upon behavior, with no attempt being made to seek in- 
dependent definitions in terms of physiological mechanisms such as 
need reduction or the activity of brain centers. It is assumed that for 
any organism there exist events having the property of acting as rein- 
forcers (else the organism would not survive). A reinforcer is expected 
to exert similar effects upon different responses in the same situation 
and similar effects upon the same response in different situations. 
Thus the concept of reinforcement is by no means circular. Once a 
reinforcing operation is identified for a given organism, the system 
it can be used in the future to modify behavior. The aspect 


specifies how > $ 8 
of behavior altered by reinforcement is said to be response strength, 


or simply likelihood of responding, this property being most directly 
manifest in the rate of responding in any situation where the organism 
is not paced by external events. 

Within this system one does not speak of the acquisition of knowl- 
edge, nor even of associations, but simply of modification in behavioral 
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tendencies as a result of reinforcement history. Skilled 5 
of various sorts are assumed to develop by way of discrimination goes 
differentiation. If a response is originally made indiscriminately to ra 
different stimuli present in an experimental situation, as to black ane 
white cards in a simple diserimination problem, the response Is 
brought under the control of the stimulus property by differential rein- 
forcement. For example, k 
food whenever it makes a given response in the presence of the blac 
stimulus and withholding reinforcement when the organism makes the 
response in the 


o ` resenting 
one may reinforce the organism by presenting 


presence of the white stimulus. If the organism e 
nally makes the response, say operation of a response key, in a variety 
of ways. and one desires to train the organism to make the response 
always with a particular force, the 


2 i roemeit 
procedure is to make reinforcem 
contingent upon 


responses having forces falling within a specified 
range and withholding reinforcement 
forces outside the specified range 
gency —reinforceme 


whenever responses occur with 
Always a critical concept is contin- 
nts must be made contingent upon responses of a 
specified character occurring only 
or stimulus properties. 
Auxiliary principles of major 
schedule and shaping. A 


F é 45 vit li 
in the presence of specified stimu 


import for training are reinforcement 
schedule consists in a set of rules specifying 
the precise conditions under which an organism will receive reinforce- 
ment for a given response. A vast variety of schedules has been 
studied in detail for several simple operant conditioning situations, in 
particular bar pressing by the rat (Skinner, 1938) and key pecking by 
pigeons (Skinner, 195( 


: Ferster & Skinner, 1957). If the experimenters 
or trainer's objective is to build a habit that 
erable period even when reinforcement 
training schedule is arranged so that the 


will persist over a consid- 
may not be available, the 
animal frequently has experi- 


ences of responding without immediate reinforcement and has an 


opportunity to learn that if it persists in responding 


under such circum- 
stances further reinforcements may be 


forthcoming, In more system- 
atic terms, the response under investigation becomes associated with 
stimuli occurring on unreinforced occasions as we 


ll as with stimuli 
present only on or following occurrences of re 


inforced responses. 

ene ehavior by successive approximations is a rocedure that 

Shaping b è k p 

has evolved for training a response in a given situation eve n though the 
rganism originally has no tendency to make the given response, so 

org £ 8 


that under simple trial-and-error routines the response would never be 
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rewarded and thus never strengthened. The idea is to observe the orga- 
nism's behavior in the given situation and determine the class of re- 
sponses which it does emit that resemble in some way the one which is 
to be trained. For example, if a pigeon is to be trained to peck a key, it 
may originally exhibit no tendency to do so but may nonetheless make 
movements in the direction of the key and, on other occasions, may 
make pecking movements. The training program then utilizes a 
changing series of reinforcement contingencies in which the animal is 
d whenever it moves, then is reinforced whenever it ei- 


first reinfore 
ther makes a pecking movement or moves in the direction of the key, 
and finally is reinforced only when it actually pecks the key. 

is upon the actual modification of behavior as 


Because of the empha: 
opposed to the inference of underlying processes, the operant condi- 
tioning approach would seem to be a natural framework for at least 
some types of research on the training of the mentally retarded. In- 
ams have been under way for a 


deed, substantial research prog 
number of years exploring this motif (e.. Bensberg, 1965: Bijou, 1966; 


Bijou & Baer, 1967; Sidman & Stoddard, 1966). 


2. approach to mental retardation 

The general viewpoint associated with this approach to problems of 
mental retardation has been presented by Bijou (1966). Bijou points up 
rather sharply the disparity between the rapid advances in concepts 
and methods in various areas of psychology and the rather meager re- 
turns to date from psychological research on problems of mental retar- 
dation. Bijou feels that the main cause of the slow rate of progress is 
the traditional view of mental deficiency as a hypothetical condition of 
the organism inferred from behavior. He proposes that instead re- 
das principally a function of inadequate 


tarded development be viewe 
histories. Inadequate performance, 


reinforcement and discriminative 


whether in everyday life situations. 
should be taken simply as evidence that the appro- 


e not been selectively strengthened by differential 


intelligence tests, or school 


learning situations. 
priate behaviors hav 


reinforcement in the past. 
Analyzing the situations to which the retarded child is typically ex- 


posed, Bijou points out that a major cause of inadequate reinforcement 
histories may be the meager resources for selective reinforcement in 
institutions or foster homes. Further, the tendency to treat the retarded 
ironically ill, restricting his exploratory behaviors and not 


child as cl 
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iors on the 
i inf f ifferential behaviors 
making reinforcements contingent upon differential 


* ad repertoire 
child's part. might be expected to lead to a grossly restricted rep 
of discriminative behaviors. 17 18 Lie 
The fact that the retarded child is often a behavior proble tes in 
to lead to considerable use of punishment on the part of schedule 
charge of the child and it is well known that a reinforcement condu- 
depending largely upon punishment rather than reward is no the ef- 
cive to building UP repertoires of differential behaviors. Also. 


vt 
r . s n 1 è would ne 
fects of punishment are likely to generate side effects that 
be anticipated by the 


cially undesirable 
conditions suitable 


ly 


s0- 


layman. For example punishing a chila img 
verbal behavior without at the same time 2 be— 
> for differential reward of a more acceptabtt 
havior may lead the child to 
speaking unintelligibly. 
occurs and the peuple 
tional responses 
hardly conducive 


Bees aS 
f P ch artifices 
avoid punishment by such ar 


f nt 
0 i 5 ‘ Ages nishme 
Further, the situations in which 11 for emo- 
8 oe i o 
involved become conditioned stimuli ae 
8 i ‘ol 
and lead tw avoidance, thus setting up < 


to constructive Je 


n in- 
5 ern ond FEL 
arning. Finally, unintentioné 
forcement contingenci 


1 
es may lead to the specific shaping of ee 
undesirable behaviors. If, for example, the retardate has few vie mid 
available which obtain attention from adults, he may find by trial é 
error that this end is achie 
thus these would be uninte 
It must be rece 
tories in the 


ag 0 
red only by socially undesirable acts an 
ntionally strengthe 
nized that the role o 
causation of 
mented by very little 
pected that Bijou's a 
level by controlled e 
dence of extre 


ned. Vein 
f inadequate reinforcement = 
retarded intellectual functioning is dot — 
specific evidence, Indeed it is scarcely to be a 
nalysis can ever be fully verified at the romney 
xperimentation. Though it is often easy to find ev! 
mely unfavorable conditions for Je 
ground of the retarded child, it is never possible 
the same inadequate learning condition in 
background would have led to the same final intellectual performance: 
I suspect that the answer of the investigator. 

tioning tradition would be that such 
Proceeding on the premise that the 


rized by Bijou is essentially correct, 


i ack- 

arning in the an . 

. ether 

to determine whet 5 
a child with different gene 


in the operant-condi- 
comparisons are not essential. 
analysis of retardation as summa- 
they would Proceed to attempt to 
overcome specific varieties of retarded intellectual functioning by spe- 
cife reinforcement procedures. Success in these e 


fforts would cer- 
tainly be taken as support for the theoretical analysis. However, the 
alr . a 3 A. P 
exse does mat hold — failures to overcome these deficits in intellec 
convers = 


] formance by training procedures would not justify the conclu- 
tual per 


2. Approach to Mental Retardation 141 


sion that the deficits are not attributable at least in part to unfavorable 
learning conditions in the past. 

A considerable body of research on effects of early experience upon 
behavior in both animals and human beings provides some support for 
Bijou’s analysis of the causes of mental retardation. A thorough review 
of this literature and its implications for problems of mental retarda- 
tion has recently been presented by Haywood and Tapp (1966) and 
need not be duplicated here. It is clear that deficiencies in sensory 
input and in opportunities for varied learning experiences in early life 
may produce irreversible changes in adult performance in learning sit- 
uations as well as emotional and motivational disturbances. 

Further, studies providing comparisons of effects of different envi- 
ronments upon mental development of children leave little doubt that 
continued residence in custodial institutions where the children lack 
normal opportunities for learning experiences involving the continuing 
interaction with particular adults normally characteristic of a family 
often leads to a progressive decrease in measured intelligence and also 
to retardation in development of motor functioning and even physical 
growth. Some broad-scale studies (e.g., Gray & Klaus, 1964) have, by 
administering intensive programs of varied learning experience over 
periods of months to culturally deprived children, produced appreci- 
able gains in measured intelligence, as compared to controls, and im- 
proved school learning performance. Thus, regardless of whether the 
analysis of retardation in terms of conditioning history can ever be 
supported in an extreme form, there is a tangible basis for anticipating 
major benefits from the kind of detailed analysis of deficient perform- 
ance and individual programming of retrai ing procedures that are 
characteristic of the operant conditioning approach. 

An excellent illustration of the operant conditioning approach in ac- 

ch has been presented 


tion in the context of mental retardation rese 
by Sidman and Stoddard (1966). These investigators set chemselves the 
task of determining whether severely retarded, institutionalized indi- 
riminate among simple geometric forms. Specifically, 


viduals could dis > geome . 
in the experiments to be discussed, te distinguish circles from ellipses. 


The type of individual with whom they proposed to establish communi- 
cation and to make these determinations is typified by their first sub- 
ject, a 40-year-old microcephalic idiot with an IQ below 20, with an ac- 
tive vocabulary limmed to three words which he did not regularly use 
appropriately, and with the ability to respond only to a few simple 
‘ommands such as “come here.” Preliminary tests with methods of 
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the sort ordinarily used to assess sensory capacities had suggested 12 
this individual could not distinguish a circle from an ellipse, but a 
possibility remained that this failure w as due to lack of a communica: 
tion between the patient and the examiners, 1 a 

Within the theoretical framework in which Sidman and Stoddarc 
were working, it would be concluded that the individual could distin: 
guish a circle from an ellipse if he could be trained to make nes 
responses to stimuli of these two classes over a series of trials in which 
extraneous bases for diffe 


p dieng of the 
rential response, such as positions 01 
stimuli, were eliminated. 


to shape the individual’ 
would be emitted unde 
arrange that the individ 


To arrive at this stage it would be necessary 
s behavior so that usable indicator responses 
r the control of a reinforcement schedule, t° 
ual should be effectiy ‘ly stimulated by the Ee 
types of discriminative stimuli, and then to develop the desired contin- 
gency of responses on stimulus properties. 

The first step in the training 


5 ; ie ind 
program for each subject was to fi 
suitable reinforcing age 


> 5 ; severely 
nts, that is, rewards. Even for the most severe H 
retarded patients studied, this evidently was not difficult, with sue 

measures as Opportunities to 4 


ants, the use of nove 


play with young and enthusiastic as f 
l stimuli such as chimes, and the administration 0! 
novel food rewards such as small candies. Before behavior can be 
shaped in a particular direction, it must be emitted in considerable va- 
riety in the given situation, Thus the next ste 
to permit the individual to habitu 
perimental situation, such 
proach behavior toward the 
Then the learner wa 


p in training was always 
ate his original responses to the ex- 
as exploration, and then to strengthen ap- 
Situation by me 
s led by such device 
and manual guidance to ope 


ans of a variety of rewards. 
Sas gestures, demonstration. 
rate a simple manipulandum which oper- 
ated a reward dispenser. The subject very quickly learned an easy dis- 
crimination. A panel of keys was presented to him, only one of which 
was lighted on any trial: operation of the lighted key led to reward and 
the dark keys to nonreward. This step could 
without the occurrence of errors. 

The remainder of the training program, b 
to find a sequence of steps whereby the d 
gradually to lighted keys containing ellipses in such a way that the sub- 
ject would continue responding only to the original lighted key which 
contained a circle. This differentiation procedure was accomplished by 
drawing very elongated ellipses on eee keys and then very grad- 
ually over a series of trials increasing the illumination behind the dark 


be accomplished almost 


y far the most difficult, was 
ark keys could be changed 
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keys so that the ellipses gradually became visible. The end result of 
this training regimen was a situation in which the subject responded to 
of its position, even when sur- 


the lighted key with a circle, regardless 
rounded with other lighted keys containing ellipses. Thus the investiga- 
tors were able to conclude that the severely retarded patient’s original 


apparent inability to distinguish circles from ellipses was due, not to a 


lack of sensory capacity. but to an inability to communicate appropri- 
ately with the examiners. More technically, in terms of operant condi- 
tioning theory, the patient originally differed from normal individuals in 
having no already established differential responses to the stimulus 
classes involved in the test; thus differential responses had to be estab- 


lished by suitable training before his discriminative capacity could be 


revealed. 
Sidman and Stoddard present a series of extremely instructive ac- 
loping suitable training programs for 


counts of their experiences in deve 
and also for very young 


a number of severely retarded individual 
normal children. Throughout, the emphasis is upon the prevention of 
most ubiquitous observations is that the occur- 


errors, for one of their 
rence of errors tends to be highly disruptive, perhaps more so with re- 
s. Whenever in the sequence of steps 


tarded than with normal learner: 
imination is being trans- 


during which an originally very easy disc 
formed into the final more difficult one errors begin to occur, the result 
is nearly always a severe disruption of performance. Then the investi- 
gators must go back to a much earlier stage in the program and bring 
the learner up to the point of dificulty again with suitable revision, 
since continued training on the same stage, even with differential re- 
learner past the point of difficulty. 

ision of a training 


ward, generally does not get the 
Of particular theoretical interest is a major rev 
sstigators were led to by an observation that 


program that these inve 
d on the basis of discrimination 


might perhaps have been anticipate 
theory (see Part A). A number of subjects had made rapid progress in 
program but broke down before reaching the 
e investigators that the con- 
orrect keys and 


the early stages of the 
final discrimination, and it occurred to th 
current changes in brightness of background of the inc 
emergence of the ellipses on those keys might have been responsible. 
Over a series of trials, as the fading proces proceeded, the ellipses be- 
came visible on the incorrect keys, but since the background bright- 
ness on those keys still differed considerably from that of the correct 
ne to use the brightness cue even after 


one, the subjects could continu 
s theory, the 


the ellipse was visible. In terms of Zeaman and House” 
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subjects were continuing to receive reinforcement for attending — 
ultimately irrelevant dimension and thus were failing to learn to . 
to the relevant dimension. When the brightnesses of correct and soos 
rect keys ultimately became equal, the subjects who had learned no 

attend to the forms were, 


of course, unable to differentiate. 
The remedy 


was a revised program in which the fading in of ee 
ground brightness on the incorrect keys and the emergence of the 0 
lipses were separated. Starting with a single correct key which was 
bright and contained a circle 
remainder of which w 
gradually increase 
forms prese 


7 5 ceys the 
> Positioned randomly among eight ke) hi 
a -rect keys was 
ere all dark, the brightness of incorrect ke y: rn 
ili 3 dev ith T 
d until it equaled that of the correct ke y, but w 


k ars thie stage the 
nt on the incorrect ones. Thus at the end of this stag 
learner had to differentiate 


i athe presence 
correct from incorrect keys by the pre 
of the circle on the 


former and thus was being rewarded for attending 
to the relevant stimulus dimension. After this stage had been suci . 
fully accomplished, faint ellipses began to appear on the incorrect keys 
and these were gradually filled in and darkened until the learner finaly 
responded correctly to the lighted key containing the circle ant 
avoided the lighted keys containing ellipses. 

This study is especially signific 


ant in bringing out the importance a 
patient, detailed analysis of the steps required to proceed from the ini, 
tial stage in which the learner's behavior is indiscriminate to a final 
one in which it is brought under the control of particular stimuli or 
stimulus dimensions, with errors at ever 
being the principal basis for diagnosis 
of the learner, but of failures on 


K ace garam 
y stage in the training progral 
s on the part 


the part of the training program. 
Whenever errors appear, it is assumed by the. 


represent a failure of the trainer to 
effectively rewarded for response 
property at the preceding stage. 
The long-term practical import of these demonstrations seems im- 
pressive indeed. If the severely retarded can be trained to discrimindte 
circles from ellipses, there is little doubt but that they can be trained 
also to distinguish and respond appropriately to a variety of signs and 
symbols. Except for research purposes, it would 
sible to tie up a large number of highly skilled 
crimination training of a single retardate. Howe 


> not of incapacitie 


se investigators that they 
insure that the learner has been 
to the appropriate stimuli or stimulus 


not, of course, be fea- 
individuals in the dis- 


ver, there are two im- 
rtant mitigating factors. Firstly, it is already apparent from the work 
po D 


discrimination training that an effective training Program, once de- 
sc 5 N rit 
etm 1, can be used for many different individuals with the amount of 
veloped, ca s 
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individual tailoring small relative to the effort involved in the original 
development of the program. Secondly, with the use of techniques that 


are rapidly being developed (see, e.g.. Suppes et al.. 1968) in computer- 


assisted education, it may soon be possible to implement rather large- 
scale training programs with relatively modest demands on skilled 


manpower. 
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XIII. Contemporary Discrimination Learning Theorte 


During the period when most of the learning theories discussed in 
preceding sections, and in particular those of Hull, Spence, and Skin- 
ner, were under development, 
part treated as subsidiary 


a p p en » most 
discrimination learning was for the r 

. ante ro- 
to the presumably more fundamental | 


á 8 es * i was 
and extinction. The prevailing assumption Wë 
that phenomena of diser 


propriate combination 
arising from studie 


cesses of conditioning 

i s n 8 . y ap- 

imination could be interpreted simply by 22 

l ee inhibition 
of such concepts as excitation and inhibi 


‘sof more elementary learning and conditioning. 
More recently, however, the 5 
human levels has shifted radically. discrimination becoming perhaps 
the central focus of both experimental and theoretical development. Te 
be sure, current theories draw he 
from conditioning the 


i‘ 5 imal and 
focus of research on both animal a 


avily upon concepts that have evolved 
ory, but these concepts are augmented by ideas 
coming from other sources. As we have seen in the preliminary disct 
sion of Part A of this volume, concepts having to do with attention and 
coding combine with principles of 
in a multiple process conce 


association and response selection 
ption of discrimination learning. 

Of the specifie formulations falling in this category, that of Zeaman 
and House (1963) has developed in closest inte 
mental development. Thus it will be 
poses to begin with a rather det 
posed by Zeaman and House 


. * are: mm 
raction with research t 
convenient for our present pur- 
ailed consideration of the model pro- 


and some associated lines of experi- 
mental investigation. In following sections l shall consider alternatives 


to the approach of Zeaman and House which are espoused by other 


o deal explicitly with problems of 
the development of learning abilities and also review some current re- 


investigators currently attempting t 


search which seems especially relevant to 
tions. More extensive presentations of influe 
terpretations of discrimination learning are 
(1965, 1966). Neimark and Estes (1967). 
(1968). 


these alternative concep- 
ntial Multiple process in- 
to be found in Lovejoy 
and Trabassy and Bower 


l. background and experimental contex 
a 2 


t of the attentional 
theory of Zeaman and House 


Although it could presumably be generalized to othe 
f g 


: r situations, the 
I discrimination learning proposed by Ze 
theory ¢ 


aman and House 


116 
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(1963) has been developed in close connection with a lengthy series of 
arches in a particular type of discrimination experiment. Thus it 
will be convenient to present a résumé of their theory in terms of the 


re 


associated experimental paradigm. 

The apparatus has been adapted from the well-known WGTA used 
by Harlow (1959b) and his associates in their extensive studies of dis- 
crimination learning sets in monkeys. As adapted for use with chil- 
dren, the apparatus takes the form of a small desk in front of which the 
subject is seated. Immediately in front of the subject is a horizontal 
tray on which two stimulus objects are arranged in left-right positions, 
each being located above a well in which a bit of candy, which is the 
standard reward in these experiments, can be placed by the experi- 
such as ink bottles or 


menter. The stimuli may be ordinary obje 
paper clips (used in so-called ‘junk’ discriminations) or, more often, 
al figures are drawn. Beyond the stimulus 


plaques on which geometric 
tray is a vertical panel containing a one-way mirror behind which the 


experimenter sits during the session. Lighting is arranged so that the 
experimenter can view the subject but the subject cannot see the ex- 
perimenter. At the start of an experimental trial, the experimenter 
slides the stimulus tray back out of sight behind the screen, places the 
reward in one of the food wells and arranges the stimuli appropriately, 
then slides the tray out so that the subject can make a choice. 
Suppose, for example, that the stimuli used in the experiment are a 
ack triangle and that the task is for the subject 


black circle and a bl 
The position of the triangle would be 


always to choose the triangle. 
varied randomly between the left and right food wells so that, to 
master the task. the subject must respond on the basis of the stimulus 
property and not simply on the basis of position. Ordinarily a correc- 
aning that if the child lifts the correct ob- 


tion procedure is used, me 
ject, he finds the reward under it at once, whereas if he lifts the in- 
correct object he is permitted then to lift the correct one and obtain 
the reward. To be credited with a correct response on any trial, the 
lect the correct object as his first choice. 


subject must se 
g simply discrimination between two objects is nor- 


A task involvin 


mally relatively easy even for severely retarded children and thus most 
of the studies conducted in this situation use problems involving at 
least two dimensions, one stimulus dimension being relevant and the 
other irrelevant to solution. Most commonly the two dimensions are 
form and color. A typical problem in this category might involve two 
forms, say triangle and circle, and two colors, say red and green. Four 


stimuli would be used, a red circle, a green circle, a red triangle, and a 
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green triangle. On each trial two stimuli would be presented, each rep- 
resenting one value on each dimension. Suppose that in a particular 
case color were the relevant dimension with red correct and green in- 
correct. This would mean that over a series of trials the four stimuli 
would appear equally often, but on each trial one of the objects pre- 
sented would be red and one green, and the reward would always be 
under the red stimulus regardless of whethe 
circle. 

Originally, Zeaman and House worked within the fram: vork of the 
Hull-Spence type of association theory, which has been reviewed 
above. In terms of this kind of theory, the illustrative problem would 
involve four habits, each of which might acquire some habit strength 
over a series of training trials: 


rit was a triangle or a 


the habits would be those relating the 
approach response to circular, triangular, red, and green stimuli. The 
response of approaching a red stimulus and rece 
occur on every trial, and consequently the 
grow steadily along an expone 
a series of training trials. 


iving reward would 
strength of this habit would 
ntial function to its maximum value over 
The response of approaching a green stim- 
ulus would receive only delayed reward, and thus the 
strength attainable by this habit would be 
to red. The responses of approac 


maximum 
lower than that of approach 
hing circular and triangular stimuli 
would sometimes receive immediate reward and sometimes delayed 
reward (equally often on the average) and consequently these would 
reach intermediate strengths. As the strengths of these habits diverged 
over training trials, the probability of choosing the 
would steadily increase to some limit which would 
combination of motivation, reward, and stimulus differences. However, 
relatively early in their series of studies, Zeaman and House made 
some observations which suggested to them that this type of theory, a 
single-stage model as they termed it, was inadequate to represent dis- 
crimination learning. 

In the classical literature on discrimination 
common method of following the course 


correct stimulus 
depend upon the 


learning, the most 


of learning has been simply to 
plot the proportion of correct responses per trial or per block of trials 


for a group of subjects over the course of training. When Zeaman and 
House (1963) did this for a sizeable group of retarded children run on a 
simple discrimination in their apparatus, they obtaine 
curve of the usual form, starting near chance (50% 
rising quite regularly along a negatively 
level of near 100% correct performance. 


d a learning 


in this case) and 
accelerated course to a final 


When, however, they subdi- 
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vided these subjects into more homogeneous subgroups on the basis of 
number of days required to meet the criterion of learning, they found 
that in general the subgroup learning curves did not have this form. 
Except for the most rapid learners, the subgroup typically remained 
virtually at chance for a considerable number of trial blocks and then 
rose relatively steeply to the final common level of nearly 100% correct 
responding. Thus it became clear that the group learning curve was 
not representative of the course of change in performance on the part 


of individual subjects. 

As a consequence of this finding, Zeaman and House were led to 
reanalyze their data by means of “backward learning curves.” This 
procedure, described by Hayes (1953) and by Suppes and Ginsberg 
(1963), takes as the reference point for each subject, not the initial trial 
of the experiment but rather the trial on which the subject meets the 
criterion of correct performance (20 correct responses in a daily block 
of 25 trials in the present instance). In preparing a backward curve, 


the protocols for individual subjects are brought together so that Trial 
l is the trial preceding the beginning of the criterion block for each 
subject, Trial 2 the trial before that, and so on; then the mean propor- 
tion of correct responses is computed for the trial preceding the crite- 
rion, the second trial preceding the criterion, and so on backward. 
When this analysis was applied to the data for the group of retard- 
d to above, the backward learning curves for the various 
virtually superposed. All curves began with virtually 
h proportions of correct responses were 


ates referre 
subgroups were 


horizontal segments over whic i 
at chance, followed by a relatively sharp rise to 100% correct over 


about the last 40 trials preceding attainment of criterion. Thus the fast 
and slow learners were found to differ primarily in the number of trials 
before their learning functions began to depart from the chance level, 
and not in the slope of the final sharply rising portion of the curve rep- 
resenting the relatively abrupt transition from chance to criterion per- 


formance. , E 

Zeaman and House concluded that a single-stage discrimination 
model of the type associated with the Hull-Spence theory did not fit 
the pattern of results revealed by these analyses. They felt that, if the 
mastery of a discrimination were simply a matter of the steady diver- 
gence of strengths of correct and incorrect habits as a function of 
training trials, there would be no basis for predicting the observed 
forms of the backward learning curves, with the long initial horizontal 


segments at chance followed by a steep rise to criterion, or the fact 
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that subgroups of subjects with different learning rates differ primarily 


in the lengths of the initial plateaus. 
The extent to which these analyses by Zeaman and House actually 
controvert a single-stage association theory of th 


e discrimination pro- 
cess is difficult to say. 


There are several difficulties. Firstly, even if a 
single-stage theory necessarily implie 


s that correct response proba- 
bility should increase along a negative 


ly accelerated growth curve as a 
function of trials, little is known concerning 
backward curve should take 
individual learning curves hav 


what precise form the 
if plotted for a group of subjects whose 
e differing slope parameters. 

The only bit of evidence at all pertinent to this question that this re- 
viewer has seen in the literature is an analysis presented by Trabasso 
(1963) in connection with a comparison of a single-st 
with a two-stage, all-or-none model, the | 
ance remains at chance until the 


age, growth model 
atter assuming that perform- 
final abrupt rise to criterion. In order 
to check out predictions from the growth model, Trabasso ran a group 
of computer-simulated subjects for such a model and plotted their 
mean backward learning curve together with the predicted backward 
l (which is of course 
criterion) and the curve for dat 


curve for the all-or-none mode horizontal prior to 


a obtained from human subjects. His 
observed data fit the all-or-none prediction closely, whereas the data 
learning according to a 
yielded an average backward curve 
ated fashion over the trials precedi 
dence would tend to support Ze 


for the simulated subjects growth model 
g 

which rose in a negatively acceler- 
ng criterion. This fragment of evi- 
aman and House’ 
in fairness to adherents of the Hull-Spe 


tioned that Spence freque 


s reasoning. However, 
nce system, it should be men- 
ntly emphasized that neither he nor Hull had 
worked out their theory in quantitative detail for the case of learning 
under a correction procedure which has been used in the series of 
studies by Zeaman and House. 


In any event, the question whether the logic involve 


R din Zeaman and 
House's original theoretical orientation 


5 was airtight is largely aca- 
demic. The fact remains that their analyses of backward learning 


n their situation might 
represent a two-stage process, and they continue 


analyses with cumulative results which, over a penodol ine 5 
provide rather impressive support for their position. 
Backward learning functions on a color-form 


curves suggested that discrimination learning j 


d to conduct similar 


AS On discrimination for 
groups of retarded children of differing MAs yielded an even sharper 


picture than the original analysis, with the different MA groups dif- 
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fering almost solely in the length of the initial plateau. In one analysis 
presented in their 1963 article. for example, groups of children with 
mean MAs of 2 years, 4 months, and 4 years 6 months, were com- 
pared. The backward curve for the 4-6 group rose steeply from chance 
The curve of the 2-4 


to near 100% performance over about 40 trials. 
group differed only in that it hovered around the chance level of 50° 
correct responding with no obvious trend for about 180 trials before 
beginning to rise, then like the 4-6 curve rose steeply to virtually 100% 
performance over about 40 trials. 

with regard to theory, perhaps, is the fact that 
aman and House compared perform- 
problems of differing diffi- 


Even more suggestive 
similar patterns emerged when Ze 
ance for similar groups of children learning 
form discriminations, it is generally found, 


iminations are consider- 
discriminations 


culty. For example, in color-f 
at least with retarded subjects. that color dis 
ably more difficult than form discriminations and 
based on two-dimensional objects. Backward curves for groups of chil- 
es of discriminations (Zeaman & House, 


dren learning these various typ 
1963, p. 165), yielded a rather striking picture. The curves for the prob- 
lems of varying difficulty differed greatly with respect to the length of 
the initial plateau over which per 
but all were exceedingly similar in the steepn 
once it departed from chance and rose to criterion over an interval of 
20-40 trials. 

This finding was evidently on 


gested to Zeaman and House that 
2 attentional process anda learning process, 


ent of the backward learning curve repre- 
t had come to attend to the relevant 


-formance remained at a chance level, 
s of the learning curve 


e of the bits of evidence which sug- 
the two stages in discrimination 


learning might represent ar 
the initial horizontal segm 


fore the subjec 
the given problem and the final steeply rising 


ng the process of associating the correct 


senting a period be 
stimulus dimensions for 
portion of the curve representir 


instrumental response with the relevant cues once these had been dis- 


covered. 


2. a two-stage, attentional model for discrimination learning 
The specific two-stage discrimination learning process to which 
Zeaman and House were led by their analyses was spelled out in the 
following assumptions 

J. On each trial of a discrimination experiment the stimulus display 


ty of stimulus dimensions. The dimensions include 


(Zeaman and House, 1963). 


represents a varle 
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not only those specifically controlled by the 


experimenter, such as 
color and form in a color-form discrimination. 


but position and various 
distance, and the like. 
rimination problem. 


other uncontrolled dimensions such as texture 
which are generally irrelevant to a dise 

2. The subject attends to only one 

3. The dimensional aspects of stir 
the subject’s attending is the perce 
a circular form, the color red), 

4. The specific cue which is perceived as a result of attending to a 
dimension elicits the instrumental response of approaching and lifting 
one of the stimulus objects. 

5. Dimensions are broad classes of stimuli having common discrimi- 
native properties. Instances of dimensions which have been found to 
be important in discrimination learning by retarded children are form, 
color, size, position, and brightness. 

6. If the subject's choice on a discrimination learning trial is re- 
warded, both the observing response and the instrumental response 
which he made on the trial increase in probability of occurrence. The 
term observing response refers to the behavior of attending to a stim- 
ulus dimension. 5 


7. If the subject’s choice on a trial is not rewarded, both his ob— 
serving response and his instrumental response decrease in probabil- 
ity, and observing responses which do n 


4 ot occur on that trial receive 
complementary increases in probability, Similarly, when a response is 


dimension at a time. 
muli elicit attention; the result of 
ption of specific cues (for example, 


3 kas ən that trial de- 
crease in probability. 


8. In the two-alternative discriminative situation, the probability P 
of the overt response of choosing the corr 

by the expression P = ap + (1 — a)/2, whe 
of observing the relevant dimension and 


the correct instrumental response given Perception of the stimuli of 
that dimension. The probability of a Correct instrumental response to 
an irrelevant cue is taken to be 1% under the usual experimental ar- 
rangements. 


ect stimulus object is given 
re a denotes the probability 
p the probability of making 


To elucidate these assumptions in ter 
tion-learning situation, let us take 
color-form discrimination problem wit 
red being correct and green incorrect, 
ning of the experiment a subject has so 


ms of the Standard discrimina- 
as an example the familiar 
h color the relevant dimension, 
It is assumed that at the begin- 
me probability, generally low, of 
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attending to the color dimension, some probability of attending to form, 
which is irrelevant in this problem, and some probability of attending 
to other uncontrolled and irrelevant dimensions. Since all irrelevant 
variables are carefully balanced over a group of subjects by the experi- 
mental design, we can assume further that at the beginning of the ex- 
periment, on the average, a subject’s probability of choosing the cor- 
rect object is H. regardless of what dimension he attends to. 

On early trials, the subject’s choice of a stimulus object will be 
rewarded on about 2 of the trials and unrewarded on % in a random 
sequence, regardless of what dimensions he attends to. The probabili- 


ties of attending to the various dimensions will receive increments and 
decrements equally frequently on the average and thus will continue to 
hover around their original values. However. for each trial on which 
the subject happens to attend to the relevant dimension, color in this 
instance, his probability of making the correct choice response when 
attending to color will be increased (either because he makes the cor- 
rect choice by chance and is rewarded or because he makes the incor- 
rect choice and is not rewarded). Once the probability of making a cor- 
rect choice when attending to the relevant dimension has risen above 
chance, so also will the probability of being rewarded when attending 
to the relevant dimension; the response of attending to color will begin 


to receive increments more often than decrements in probability and 


therefore will begin to rise above chance. Thus the process will accel- 
erate like an autocatalytic chemical reaction, each increase in proba- 
bility on the part of either the instrumental or the attentional response 
leading to an increase in the likelihood that both responses will gain in 


probability on subsequent trials. a 
According to this conception, the initial plateau of the backward 


hich performance hovers around a chance 
level, represents a phase during which the subject is predominantly 
attending to irrelevant dimensions and thus is making no progress 
whatever toward mastery of the discrimination. When the subject 
finally happens to attend to the relevant dimension, the reward he re- 
ceives will produce an increase in his probability of making the correct 
response when attending to that dimension, and the accelerative phase 
of his learning process will begin. 

These assumptions concerning the subject’s tendencies to attend to 
different dimensions and to make choices once he has perceived the 
cues of a given dimension are qualitatively similar in many respects to 
those of other two-stage theories of discrimination learning, notably 


learning curve, during W 
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those of Kendler and Kendler (1962) and of Sutherland (1959). Had 
Zeaman and House stopped at this point in their theory construction, 
their formulation would have suffered the same limitation as the others 
mentioned. The earlier theories have been influential in directing re- 
search in the hands of the originators, who have intuitive feel for the 
mode of operation of their only partially specified theories, but have 
been difficult for other investigators to understand fully or to test rigor- 
ously because of ambiguity concerning just what empirically testable 
propositions follow from the assumptions. However, Zeaman and 
House took the additional step of presenting their theory in a provi- 


sional quantitative form. At the cost of some simplification of their 


general notions and a temporary restriction to a specific type of experi- 


mental application, they made it possible for themselves and others to 
work out detailed implications of their assumptions. 

The approach of Zeaman and House 
component learning processes 


was to assume that each of the 
included in their schema proceeds in 
accord with the same mathematical process assumed in a large class of 
currently influential models for simpler forms of conditioning and asso- 
ciative learning. The basic function prescribing the way in which the 
subject’s probability of making a response 
reward and nonreward was taken over from the probabilistic learning 
models of Bush and Mosteller (1955) and Estes (1950, 1959). In the in- 
terpretation of a discrimination le 
of stimulus dimensions, th 


changes upon occurrence of 


arning situation involving a number 
e notation involve 
of the theory becomes rather comple 
the detailed propertie 


d in the quantitative form 
x. Thus, as a basis for discussing 
s of Zeaman and House’s theory, it will be conve- 
nient to present the mathematical form of their theory for a particu- 
larly simplified case. 


In the following résumé let us assume that we are dealing with a dis- 
crimination learning situation involving only two sti 
one relevant and one irrelevant, so that the subje 


or the other on every trial. Further, we shall assume that the subject's 
probability of making a correct choice when he atte 
vant dimension is exactly 4 at the beginnin 
remains at that value throughout; this would be only an approximation 
to the state of affairs generally assumed to obtain, but it gains a great 
deal in simplicity and will not materially affect any conclusions about 
the operation of the model. With these understandings, let us define 
the minimum necessary notation and recast Zeaman and House’s as- 


mulus dimensions, 
ct must attend to one 


nds to the irrele- 
g of the experiment and 
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sumptions about the discrimination learning process in quantitative 


form. 


Definitions: 
Probability of attending to the relevant stimulus dimen- 


a, 
sion on Trial n 

Pn Probability of selecting the correct stimulus given that 
the subject attends to the relevant dimension on Trial 
n 

Py Probability of a correct choice on Trial n 


Assumptions: 
l. Probability of a correct choice on any trial is related to the othe 


basic quantities by the expression 


P= appr + 20 — an), d) 


rm on the right expressing the assumption that the 


with the second te 
chance when the subject does not attend 


correct choice occurs only by 


to the relevant dimension. 
2. If the subject attends to the relevant dimension on Trial n and his 


choice is rewarded, then 

ayy ~ Oy F ol — a,n), (2) 
and 

Pasi “Pa t ul, (3) 


ith values between 0 and 1. 


where ꝙ and 0 are constants W t í 
relevant dimension on Trial n and his 


3. If the subject attends to the 
choice is not rewarded, then 


4 = An G (4) 


and 
Prii = Pn + O1—p,). (5) 
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4. If the subject attends to the irrelevant dimension on Trial n and 
his choice is rewarded, then 


anti = d — ay, (6) 


and 


Pasi H. 70 


5. If the subject attends to the irrelevant dimension en Trial n and 
his choice is not rewarded, then 


an +1 =a, + = a,n). (8) 


Equations (2)-(4) make specific the no 
are increased by reward and decrease 
case, probability of the given response 
(b or 0) of the difference 
tions (5), (6), and (8) sim 
gether with the mathema 
exclusive and exhaustive 


tion that response tendencies 
d by nonreward. In the former 
increases by a constant fraction 
between its current value and unity. Equa- 
ply express an assumption of symmetry to- 
tical constraint that probabilities of mutually 
events must add to unity. To obtain Eq. (5), 


en attending to the 


e the incorrect choice response. In 
that event, the probability, 1 — Pn, of the incorrect choice would be 
decreased by the factor 1 — 6. 


1— pass = (1—0) (1—p,), 


which upon rearrangement is Eq. (5). 

It is assumed that reward affects the 
irrelevant dimension in the same way t 
tending to the relevant dimension. Thu 
vant dimension is rewarded, the probability, 1 — ay, 
increased by the fraction ꝙ of the difference betw 
and unity, 1 —a,.;=1—a,+¢[1-(_—a 
ranged, is Eq. (6). 

For descriptive purposes, the constants 6 and & appearing in the 
basic learning functions may simply be regarded 
the value of each obtaining for a given sub 
jointly determined by characteristics of the su 


probability of attending to an 
hat it affects probability of at- 
s when attending to the irrele- 
of that response is 


een its current value 
„) this last expression, rear- 


as rate parameters, 
Ject and problem being 
bject and characteristics 
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of the task. From the standpoint of stimulus sampling theory (Estes, 
1959; Neimark & Estes, 1967) these parameters would be interpreted 
as stimulus sampling probabilities. It would be assumed that there is 
available at the onset of each experimental trial a set of stimulus ele- 
ments which are associated with the attentional responses. The param- 
eter &, then, represents the probability that any one of these elements 
is sampled (perceived) by the subject, and thus enters into the learning 
process, on any trial. Similarly, once the subject attends to the relevant 
dimension, a second set of (“discriminative”) stimulus elements be- 
comes available, and the parameter @ represents the probability that 
any one of these elements is sampled on a trial when the subject so 
attends. 

The reader may note that there are two items missing from the list of 
basic learning functions, namely those representing changes in proba- 
bility of the choice response when the subject attends to the irrelevant 
dimension. The reason for this omission is that in the simplified revi- 
sion of the model presented here, it is assumed that probability of 
choice of either stimulus when the subject attends to the irrelevant 
dimension remains constant at % (whereas in the full model it would 
fluctuate randomly around an average value of %2). 

For any experimental application, it is possible by means of Eq. 
(2)-(8), to derive expressions for the expected, or average, change in 
probability of the attentional response from any trial to the next. The 
procedure is to multiply the new value of this probability resulting 
from each possible outcome of the trial by the probability of that out- 
come and sum the quantities so obtained. Thus, with probability %, 
the subject attends to the relevant dimension and chooses the correct 
stimulus object on Trial n, in which case the new probability of at- 
tending to the relevant dimension is, by Eq. (2), a, + 60 — a,). Pro- 


ceeding similarly through all possibilities, one obtains 


a 4 1 = palan + OO — a,)| + 4, — Pr) (i G 
＋ ( - 4% la, + o(1—a,)|+U—-a,) %2 = da, 


= aj F Vy (1 — an) — hanl — py). 


(9) 


To be precise, the term on the left-hand side of Eq. (9) should be al- 
tered to indicate that it represents an average value, but we shall take 
this to be understood and refrain from introducing any additional nota- 


tion. 


In the same way, we can take account of the changes consequent on 
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all possible outcomes of Trial n and write an expression for the ex- 
pected change in probability of choosing the correct stimulus given 
that the subject attends to the relevant dimension. 


U = ,b, + 00 = p,)| 


+ al = py) [py + O00 pd) +O = ay pn 
= Pa + Call — py) (10) 


A comparable expression can be written for correct response proba- 
bility on any trial as a function of the attention and choice probabilities 
on the preceding trial: but even in this simplified form of the model 
this recursion proves too cumbersome to be instructive. We can, how- 
ever, gain some insight into the learning process represented in the 
model by examining the separate recursions for the attending and 
choice responses. 

Considering first Eq. (9), we note that if Pr 
case on the average at the be 
equation reduces to 


2, which would be the 
‘ginning of training on a new problem, the 


an +1 = ay ) A = (6/2) an 
=a, + (2 — a,). 


e average € ility 2 > i i i 
If the average probability of attending to the relevant dimension 
were at chance, it would remain there: if not, it would change in that 
1 ra G . * a 6 
direction by a constant fraction of the difference between its current 
n increases or de 
erage from its current value depends on the 


regardless of the current value. the 
sufficiently large. In the limit, if p, 


value and . In general. whether a creases on the av- 
associated value of p„, but 
change will be positive if p, is 
= 1, Eq. (9) reduces to 


An +1 = ay, + (/2) (1 — 4% 


and we see that a, is driven toward unity. 


Turning to Eq. (10), we note first of all that the average change in 
probability of a correct choice, given altention to the rë S 1 115 
sion, is always positive, provided only that probability of attending to 


On every trial, the differ- 
; Pn and unity is re 
ing directly re 


levant dimen- 
the relevant dimension is greater than zero. 

ence between the current mean value of p 
3 ion, the value of the fraction be duced by a 
mac! ae ; 

fraction, th lated to 4%. 
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curve f : . 
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since each further 
rate of growth of the other. 
Thus the model predicts the form of the 


observed by Zeaman and House, and, further, 
au with an ultimately steeply rising 
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a low initial value of a,. It is for this 
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trials varies from an exceedingly flattened ogive which rises 
than 2 or 3 percentage points over the whole series to an exceedingly 
steep and sharply decelerated curve that passes 90¢ 
Trial 50. Similar variations in form and steepne 
are observed when the number of irrelevant stimulus dimensions is 
varied while other parameters are held constant. Within any one of 
these families of curves, all of the functions rise me 
values are increased, but rather large 
produce modest effects. 

An interesting, and so far as | know novel, implication of the model 
is the phenomenon which Zeaman and House term the 
plateau.” Suppose that in a form-c 
during the original training series have learned always to choose the 
red rather than the green stimulus. If, now, 
changed so that the choice of green is rewarded, the le 
in terms of probability of choosing green is predicted to rise very rap- 
idly from its initial value of near zero, reaching the chance level in a 
very few trials, then to level off for a few trials before ag 
ating and proceeding toward a 
reason for the midplateau is that, 
the probability of the 


no more 


© correct by about 
ess of learning functions 


ore steeply as the @ 
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the contingencies are 
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observing response 
dimension decreases on every 
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for the formerly relevant 
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r things equal, 
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The analysis of backward learning e Arnlers 
yields a pattern that we should expect from our examination 1 
simplified form of the model. Namely, variation in the initial proba- 
bility of attending to the relevant dimension generates à an of 
curves which differ in the length of the initial chance-leve] Pisten But 
which exhibit similar rates of rise to asymptote once they depart from 
the chance level. 
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as the @ value for the 


urves for simulated le 
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One of the principal sources of evidence concerning Zeaman and 
has been experiments in which subjects 
n are transferred to a new dis- 
ard contingencies have 


House’s discrimination theory 
learn an original discrimination and the 
crimination in which some of the stimulus-rew 
been changed. A prediction of particular interest is that subjects 
should show positive transfer, that is facilitation of learning, on a rela- 
tively difficult discrimination if they have previously learned an easier 
discrimination involving the same stimulus dimensions. The reason is 
that the difficulty of the difficult discrimination must reside at least in 
attending to the relevant stimulus 


part in a low initial probability of 
dimension. If this attentional response could be learned during pre- 
vious training on an easier discrimination, then the difficult discrimina- 
tion would, in effect, be partially mastered before the subject had 
begun training on it and thus the performance curve for correct 
choices should be substantially accelerated. 

House and Zeaman (1960b) report a pertinent experiment conducted 
with three groups of retarded children and a color-form discrimination. 
n found that for these children, as indeed for 
color-form discriminations are much more 
yisa two-dimensional pattern than 
-dimensional object. Thus they were able to use pre- 
t discrimination to produce relatively 


rapid learning of the response of attending to a relevant stimulus di- 
mension and then to test transfer to a more difficult two-dimensional 
pattern discrimination. Three groups of children matched on MA were 
employed. The primary experimental group first learned a color-form 
rimination, for example, a three-dimensional red cross 
nsional blue triangle, and second, a two-dimen- 
mination with the same stimulus dimensions in- 
a flat red cross versus a flat blue triangle. A 
„on the second problem. A change-of- 
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ues representing each 
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children being able to learn th 
the backward learning curve showed a v 
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level of 50% correct responding before a final relatively steep rise to- 
ward asymptote. As predicted, the experimental group showed marked 
facilitation of learning on the difficult problem: 13 of the 19 children 
reached criterion on the first, three-dimensional discrimination, and all 
of these learned the subsequent two-dimensional d 
20-30 trials. Of special interest is the change-of-cues group. which 
showed marked positive transfer, although not quite to the level of the 
primary experimental group. Two-phase theorie 
tial phase of observing response learning, 
associated with specific cues rathe 


iscrimination within 


s which assume an ini- 
but with responses becoming 
r than with dimensions, would not. 
without special auxiliary assumptions, predict the substantial facilita- 
tion observed under the change-of-cues condition. 

The theoretical picture is not entire 
it must be noted that before full we 
outcome, results will be neede 


trained on a three-dime 


ly clear, however. Unfortunately, 
ight can be given to this favorable 


d for a third control group which is 
nsional followed by atw 
nation but with the relevant dime 


tive cue, changed between the 


o-dimensional discrimi- 
nsion, as distinguished from the posi- 


cha first problem and the second. In the 
absence of this control, the possibility must be considered that the pos- 
itive transfer observed in the case of the change 


House and Zeaman study might hay 
attentional response, 


-of-cues group in the 
e been based, not on le 
but on less specific 
latter are adjustment to the exp 


arning of an 

: amples of the 
? 3 i perimental situation and the previous 
experience of success with a problem generally similar to the final 
difficult discrimination. 


factors. E 


There seem to be no studies conducted espe 
prediction of a reversal midplateau, but af 
ported by Zeaman and House (1963). Reversal-learning 
shown for two groups of retardates who learned a position discrimina- 
tion followed by a reversal; the two groups differed with res pe to the 
amount of training in the first discrimination, and leeis 168 

ably the probability of attending to the relevant dimension K 1 80 8 
when the reversal was initiated. In both cases the reversal cury wi por 
some evidence of the midplateau, with the effect being cor lide ats 
more accentuated for the group which had the lesse Caray 
on the initial discrimination. On the basis of 80 
Zeaman and House suggest that not only the oceu 
teau but the direction of the difference betwee 


cially to investigate the 
ew relevant data are re- 


curves are 


r degree of training 
me simulation data, 
Trence of the 


midpla- 
n the 


178 1 . two groups was j 
accord with the predictions from their model. Sroup as in 
The matter of the midplateau demands further 


Investigation in a 
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number of respects. First, it would be of much interest to know 
whether the same phenomenon can be predicted from other extant 
discrimination theories, as for example those of Atkinson (1958) and 
Lovejoy (1965). If not, then the phenomenon would be of special in- 


terest with regard to differentiating between the alternative theories. 
l side, the available fragments of evidence are scarcely 


On the empiric 
a basis for any firm conclusions. More systematic studies are needed to 
determine the reliability of the effect and the conditions under which it 
occurs. With regard to the last point, it might be mentioned that sim- 
ilar plateaus have been observed under other conditions (Blehert, 
1965). Since Blehert’s experimental situation involved a choice among 
five response alternatives on each trial, rather than only two as in 
y to determine by suit- 


Zeaman and House’s studies, it will be nece 
able computer simulations whether Zeaman and House's model can 
predict plateaus in original discrimination learning under the condi- 
tions of Blehert’s multiple-choice discrimination problem. 

An aspect of discrimination learning which has recently received 
intensive study at a number of levels, retardates, normal children, 
monkeys, and rats, has to do with transfer of training as a function of 
relationships between the original discrimination problem and the 


transfer problem. Principal cases in addition to reversal learning are 
s. The former denotes the 


intradimensional and extradimensional shif 
case in which, following learning of a discrimination, a new problem is 
given involving the same relevant and irrelevant dimensions but with a 
new choice of cues on the relevant dimension. For example, if on an 
original color-form discrimination blue was the positive and yellow the 
negative cue, then on an intradimensional shift the subjects might be 
switched to a new problem with the same form cues irrelevant but with 


now green positive and red negative. The term “extradimensional 


se in which the relevant cues in a new problem fall 


shift” refers to a ci 
ona different dimension from that of the original problem. In the ex- 
ample, one would speak of an extradimensional shift if the subjects 
were shifted to a new problem with blue and yellow irrelevant, but 
with, say, circles positive and triangles negative. 

Analyzed in terms of Zeaman and House’s model, the task of the 
subject is quite different in different types of shifts. In the learning of 


an original discrimination, the subject must learn the observing re- 
sponse of attending to the relevant dimension and then mus: learn to 
associate the correct instrumental responses to the cues of the relevant 
dimension. Ordinarily the initial probability of the correct observing 
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response is low and that of the correct instrumental response a 
chance (.5 in a two-choice discrimination problem), and both of these 
probabilities must be carried to unity before the problem is finally mas- 
tered. Similarly, following an extradimensional shift, the initial proba- 
bility of attending to the now relevant dimension will be low and the 
probability of the correct instrumental response 
eral, it must be predicted that rate of le 
shift should be about equal to that of learning the original discrimina- 
tion. Following an intradimensional shift, however, the probability of 
the correct observing response will be high, since the subjects already 
have learned to attend to the relevant dimension which has not been 
changed, and learning of a new proble 


at chance, so, in gen- 
arning an extradimensional 


m involves only carrying the 


probability of the correct instrumental response from .5 to unity. 


Thus on an intradimensional shift learning should proceed more rap- 


idly than in either of the other cases mentioned. Further, the initial 
plateau in the backward learning curve should be abbreviated and, if 
the final level of performance on the original discrimination is suffi- 
ciently high, the initial plateau of the intradimensional shift problem 
should be eliminated completely. On a reversal problem, again the ini- 
tial probability of the relevant obse 


rving response will be high, but the 
probability of the correct instrumental response will be lower than 


chance. Thus reversal learning should pre 


intradimensional shift, but with th 
pared to original learning. Furthe 
an observing response usually proceeds much 
learning of the correct instrumental res 
generally be more rapid than original le 

A more extensive discussion of inve various types of 
transfer problems will be given below, At this point we might mention 
just one of the earlier relevant studies, an investigation by House and 
Zeaman (1962) which they analyzed in some detail in terms of the 
model. Three matched groups of retarde: 8 
training on an object discrimination prob 
sion, either color or form. Following ori 
or color relevant, subjects were transferred to a reversal, an intradi— 
mensional shift, or an extradimensional k 

In good agreement with the qualitative analysis given above 
learning of the second problem was very rapid for th i 
shift condition, only slightly less rapid f. 
much slower, with a prolonged initial pla 


ceed more slowly than an 
e initial plateau abbreviated as com- 
r, on the assumption that learning of 
more slowly than 


ponse, reversa] learning should 
arning. 


stigations of the 


ginal 


e intradimensional 
rsal condition, and 
ne chance level, for 


or the reve 
teau near t} 
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the extradimensional shift condition. Further, with plausible choices of 
parameter values, simulated data for stat children computed on the 
basis of the model yielded a set of backward learning curves very sim- 
ilar to those observed. It was noted also that learning of the extradi- 
mensional shift problems was somewhat more rapid than that of the 
original discriminations. House and Zeaman indicate that this finding 
might follow from their model given certain assumptions concerning 
relative strengths of competing irrelevant observing responses in the 
original and shift situations. However, before this aspect of the anal- 
ysis can be properly evaluated, more adequate contro] data are needed 
to show whether some improvement might occur from one problem to 
another owing to nonspecific factors, as, ror example, motivational 
level, understanding of instructions, or habituation to novel features of 
the experimental situation. 

Although the successful application of the attentional model to these 
transfer problems has led to a confirmation of qualitative predictions 
which seems promising for the theory, considerable additional experi- 
mental analysis will be needed to determine the extent to which the 
theory can claim support. An important alternative interpretation of 
this last experiment which needs to be ruled out is that the positive 
transfer from original discrimination learning to the new problem fol- 
lowing an intradimensional shift is a result, not of increased attention 
to the relevant dimension, but of adaptation to irrelevant stimuli which 
are present in both the original and transfer problems. As the experi- 
ment was done, the intradimensional shift and the reversal condition 
would evidently have an advantage over the extradimensional condi- 
tion in this respect. The more recent literature on intra- and extradi- 
mensional transfer will be examined in a later section in order to deter- 
mine how well confoundings of this sort, inevitably present in early 
experiments on a given topic, have been resolved. 

Another application of the discrimination theory which Zeaman and 
House feel yields confirmatory evidence has to do with the introduction 
of a novel stimulus at a particular stage of discrimination learning. In 
one experiment, for example (House & Zeaman, 1958), the subject was 
presented with a single stimulus, which might be either rewarded or 
nonrewarded, on the first trial of an object discrimination series. Then, 
on the second and following trials, this stimulus was paired with an- 
other. The result was that regardless of the condition of reward or 
nonreward on the first trial, the subjects strongly preferred the new 
stimulus on Trial 2. This preference was interpreted as an instance of 
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a familiar tendency on the part of both human and animal subjects to 
choose a novel over a familiar stimulus under a wide variety of condi- 
tions when other factors are approximately equal. h 

On the assumptions that novelty can be considered to be a discri- 
minable property of stimuli, and that nov 
of attention to the novel stimuli, Zeaman and House proposed that 
their model can handle this type of phenomenon. But, here we seem to 
run into a conceptual difficulty, for the theory of Zeaman and House 
differs from a number of other observing-response theories in assuming 
that the observing response relates to dimensions rather than to cues. 
One should observe effects of novelty only when new dimensions are 


introduced into a situation, not when new cues appear within dimen- 
sions already represented. 


elty enhances the probability 
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do not. The same is true of the analogous pattern found for discrimina- 
tion problems differing in difficulty (Zeaman & House, 1963; Shepp & 
Zeaman, 1966). The extent to which these results should be taken as 
support for Zeaman and House’s theory and of the implications fol- 
lowing from it depends in part on the question of whether the same 
pattern of data could be predicted from other reasonable assumptions. 
, which we defer to a later sec- 


This possibility requires careful analy 
tion. 

If Zeaman and House’s conclusion regarding the invariance of 
learning parameters is sound, one might expect that measures of intel- 
ligence would be relatively independent of rate of learning in situations 
which do not involve discriminations and thus where attention might 
be a less important factor. Whether there are any learning situations 
not involving discriminations is a moot point, but even if this state of 
affairs is an idealization, it should be approximated much more closely 
by simple conditioning than by discrimination experiments. The litera- 
ture on measures of conditioning in relation to degree of mental retar- 
dation (see Denny, 1964) is fairly encouraging for Zeaman and House’s 
position, for in general rates of conditioning have been found to vary 
little over a wide range of intelligence levels. Much the same appears 
to be true for simple motor-learning tasks, which again would be as- 
sumed to be less sensitive to attentional factors than discrimination or 
most types of verbal learning. 

What is one to make of the overwhelming importance of attention in 
Zeaman and House’s formulation? They suggest that individuals of 
differing intelligence levels vary primarily in their probabilities of at- 
tending to relevant stimulus dimensions at the start of any new 
problem. But it sHould be noted that in their model the probabilities of 
attending to various dimensions in any situation must sum to unity. 
Therefore one cannot, within their framework, meaningfully speak of a 
general trait of attentiveness comparable to such notions as degree of 
arousal or activation. From the first trial on any learning problem, the 
individual is always attending to some dimension or other. Therefore 
we must conclude that if the analysis is correct, the more intelligent 
individuals are those who are more likely to be attentive to the dimen- 
sions which are relevant in the kinds of problems used to test learning 
ability. 

On the face of it, at least, this notion seems hard to accept. One log- 
ical implication is that even the most severely mentally retarded 908 
vidual must be superior to normals on some types of problems. 


168 Contemporary Discrimination Learning Lheortes 


Zeaman and House (1963, p. 217) cite 


that this might be so—a report by Weir and Stevenson (1959) 1 
among normal children, older children did more poorly than youngé 

ones on a simple discrimination proble 
as stimuli. Instances of this sort 


r E os ine 
a fragment of evidence implying 


m involving pictures of animals 

are not very convincing, „ 
since it seems likely that the direction of the observed difference coule 
readily have been reversed by a simple 


the way of explaining the task to the 

A strong implication of Ze 
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aman and House’s interpretation would be 
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uperior intelligence if placed in an environment 
dimensions to which he 


of modifying attention to 
specific technique would 
sfer paradigm, as for ex- 
initial training of discrimi- 
rs, numbers, and the like. Another would be di- 


recting children’s 


attention to relevant features of stimuli by 
instructions, by having them touch or trace outlines of forms, as the 
forms of letters during reading, 


F and by arranging to associate rewards 
closely with relevant cues. Zeaman and House note that techniques of 


this sort have, in fact, been used by individuals (e.g., Fernald, 1943: 
Montessori, 1912; Strauss & Lehtinen, 194 


j š 8 7) who report unusual suc- 
cess in remedial training of retarded or brain-injured children. 

There can be no doubt that techniques that can attrac 
of the learner to relevant features of the task 
useful. Whether extensive reliance on such tec 
major changes in the ability of mentally reta 
normal levels of performance i 
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well into Zeaman and House's analysis, but are certainly open to other 
interpretations. The similarity in the families of backward discrimina- 
tion learning curves for discrimination problems differing in relevant 
dimensions (those for color being greatly retarded relative to form, etc.) 
and the families obtained for groups of children of different MAs on 
the same problems is suggestive. There do indeed seem to be grounds 
for hypothesizing that the rate at which the learner comes to attend to 
the relevant dimension of a problem is related in an important way to 
differences in difficulty among problems and differences in ability 
among learners. 

However, the importance of the initial probabilities of attentional 
responses, as distinguished from differences in the parameters gov- 
erning rate of learning of these responses, is so far as I know peculiar 
to Zeaman and House’s theory and distinguishes their position from all 
other currently influential approaches to discrimination learning. 
Within their model differences in the rate of learning of relevant atten- 
tional responses do not produce the same kinds of differences in back- 
ward learning curves as differences in the initial probabilities of atten- 
tional responses. One wonders whether this feature of the theory is to 
be considered a minor idiosyncrasy, which could perhaps be elimi- 
nated by some revision in the assumptions that would not greatly 
change the implications of their model in other respects, or whether it 
is a necessary consequence of Zeaman and House’s basic assumptions 
which could not be altered without major revision of the theory. Some 
theoretical analyses bearing on this point will be reported in a later 
section after we have completed a review of relevant empirical studies 
and have more systematically compared Zeaman and House’s model 
with other discrimination theories. 


4. discrimination shift studies 


Because the consequences of various types of shifts between original 
discrimination and transfer problems are widely taken to be of a spe- 
cial diagnostic value with regard to types of discrimination theories, it 
seems useful to review this literature briefly. For a more extensive re- 
view with special attention to methodological problems the reader is 
referred to Shepp and Turrisi (1966). 

Perhaps the most pervasive conclusion growing out of this literature 
is the consistent advantage in speed of discrimination learnine on ih 
tradimensional as opposed to extradimensional shifts. This relationship 
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seems well established for animals as well as for normal and retarded 
human subjects. Further, it seems quite well established that the ad- 
vantage of the intradimensional shift increases with the degree of over- 
training on the original problem. There is some indication that speed of 
learning an extradimensional shift, 
of overtraining, but this evidence 
this set of findings is widely acce 


in contrast, decreases with degree 
is not as consistent. Taken together, 


pted as establishing the necessity for 
a two-stage discrimination theory involving e 


diating responses or of selective 
& Turrisi, 1966, p. 117). 
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in this type of study assure that there 
cific stimulus-response associations from 
It is conceivable that the 
cues in the transfer problem is for 
d by previous training with the same rele- 
ption that would be required to interpret the 
5 adaptation, of. cues theory (1955). However, 
er extradimensional shift appe 
he relevant cues in the 
(Campione, Hyman, & Zeaman, 1965; Dicke 
mas, 1964). There appears to re 
than that something which the s 
crimination training, in addition 
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In some respects the most parsimonious assumption is that the indi- 
vidual learns to observe, or attend to, the specific relevant cues in the 
course of learning any discrimination problem. This idea is usually 
credited to Spence (1936) and it was first given Major emphasis in the 
discrimination theory of Wyckoff (1952) which, however, was never 
developed fully enough to yield specific interpretations of the various 
types of experiments we are considering. In order to account for the 
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tson, 1967; Shepp & Ei- 
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shift effects, the specific observing-response theories would have to be 
augmented by an assumption that transfer of these observing re- 
sponses from one problem to another occurs more readily when the 
relevant cues fall on the same dimension than when they fall on dif- 
ferent dimensions. In this line of thinking, the notion of a stimulus 
dimension might be taken simply as the experimenter's designation of 
a set of stimuli within which generalization among members is gener- 
ally high as compared to generalization from these to stimuli outside 
the set. 

The closely related, though logically distinct, approaches of Zeaman 
and House (1963), Lovejoy (1966), and, perhaps, Sutherland (1959) take 
the notion of a stimulus dimension as primary and assume that the first 
stage in mastery of a discrimination problem is to learn to attend to the 
relevant dimension. Then facilitation is naturally expected if the 
learners transfer to a problem involving the same relevant dimension, 
since they will be in a position to bypass the first stage of learning of 
the new problem. The notion of a stimulus dimension, or of an analyzer 
(Sutherland, 1959), has not been sufficiently fully explicated so that J. 
at least, can see what evidence might distinguish between two-stage 
discrimination theories of the specific observing-response type and 
those assuming attention to dimensions. Thus for present purposes 
there seems nothing to do but to group these together under one desig- 
nation, say, two-stage attentional theories. 

The most popular alternative to an attentional theory, among two- 
stage conceptions, has been the mediating-response hypothesis of 
Kendler and Kendler (1962). Again the task of critical analysis is handi- 
capped by the fact that no one theory of this type has been spelled out 
in adequate detail. As 1 understand it, the Kendler’s conception differs 
from that of Zeaman and House primarily in that, according to the 
mediational theory, not only does discrimination learning involve two 
stages, but also the final state of discrimination learning involves a 
chain of two or more responses. Rather than associating instrumental 
responses directly with the relevant cues in a situation, the learner is 
assumed first to associate some mediating response, presumably a 
verbal label in the case of human subjects and perhaps some more 
primitive counterpart in case of lower organisms, with the relevant cue 
and then to associate the correct instrumental response with the stim- 
ulus feedback from the mediating response. Thus, in a color-form dis- 
crimination with color relevant, the subject might learn to say to him- 
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self the word “red” whenever 


form, and then associate the 
word “red.” 


A N a : Se > me- 

Just as in the case of specific observing response theories, the 
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shifts. Further there is an implication that phylogenetic and develop- 
mental stage of the organism would be of considerable importance 
since presumably adult human beings would be most likely to have 
suitable mediating responses available for many types of stimuli, at 
least to the extent that these involve language. The advantage of intra- 
dimensional and extradimensional shifts would be expected to increase 
as a function of age, paralleling the development of language skills, to 
be especially impaired in the mentally retarded, and to be less ap- 
parent in animals. 
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for example to say the word “red.” whenever he saw a red triangle (or 
a red square); the correct instrumental response would then be asso- 
ciated with stimulus feedback from the word “red,” and after the 
chain was established he would always respond correctly to a red 
triangle. 

But, if the mediational assumption is simply embedded in an asso- 
ciation theory, or added to the usual assumptions of Hull-Spence 
theory (as the Kendlers seem to have done), what is the basis for pre- 
dicting that the individual will come to emit the mediating response 
“red” only in the presence of red stimuli? Why should this mediating 
response not be associated also with form cues just as would the in- 
strumental response in a one-stage model? It appears that an atten- 
tional theory, or the equivalent, is required in order to account for the 
selective association of mediating responses with relevant cues, and 
thus that the mediational conception is not an alternative or a substi- 
tute for an attentional theory. 

The other side of the coin is that the kinds of responses which one 
would expect to fill the role of mediators certainly do occur. Anyone 
who has watched a young child learning a color-form discrimination 
will have noted the frequency with which the child spontaneously 
squeals aloud “It’s the red one,” as he reaches for a red stimulus ob- 
ject. What is the function of these “labeling” responses? 

A possibility, though most psychologists seem to discount it on intui- 
tive grounds, is that they simply represent epiphenomena. A second 
possibility, and one for which a reasonable case can be made, is that 
the “labeling responses” do not mediate anything in the sense of 
serving as links of a chain leading from stimuli to instrumental re- 
sponses but rather serve functions of facilitating memory storage and 
the direction of attention. If a learner, who on a given trial has per- 
ceived a relation between a certain stimulus property and the reward 
of an instrumental response based on it, has available a labeling re- 
sponse corresponding to the stimulus property, he will be in a position 
to rehearse this relationship following the trial and thus to facilitate 
recall of this relationship at the start of a later trial in which the same 
stimulus pattern occurs. Then, on the later trial, if the labeling re- 
sponse is recalled upon presentation of the stimulus, this response 
could serve to direct attention to the relevant cue, thus increasing the 
likelihood that the instrumental response would be based on this cue 
rather than an irrelevant one. 

If this analysis is sound, it is fruitless to think of one-stage associa- 
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tional theories, attentional theories, and mediational theories as alter- 
native models for discriminational learning and to attempt to discove 


from experimental results which is correct. Rathe 


r, we might better 
think of associative learning, 


selective attention, and mediating re- 
sponses as different mechanisms, all of which can contribute to solu- 
tion of discrimination problems and which may 
combinations in organisms at different de 
much progress can be expected toward an overall theory, we need to 
determine empirically which mechanisms are operating in individuals 
at various developmental or ability levels in various types of learning 
situations. 
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instrumental response itself. In the classical conditioning situation the 
same would not be true, since occurrence of a response such as saliva- 
tion or blinking of the eye does not depend upon, and in some cases 
may indeed be incompatible with, continued attention to some aspect 
of the conditioned stimulus. 

In discrimination situations other than classical conditioning, the 
evidence seems quite uniformly to indicate that over a wide range of 
organisms including rats, monkeys, and both mentally retarded and 
normal human subjects, an attentional mechanism must be operative 
since intradimensional shifts are consistently easier than extradimen- 
sional shifts. It does not seem possible at present to be much more 
specific as to just what type of attentional mechanism is implicated, for 
the range of conditions that has been studied experimentally is se- 
verely limited. Most conspicuously, almost the entire relevant litera- 
ture is limited to the visual discrimination problem, and almost exclu- 
sively to manipulations of a very few familiar dimensions such as form, 
color, brightness, and position. The one exception that has come to my 
attention is a study by Brown and Smith (1967), showing faster intradi- 
mensional than extradimensional shift for a tactual-kinetic discrimina- 


tion. 

The fact that the advantage of intradimensional over extradimen- 
sional shifts has been found quite consistently to increase with degree 
of overtraining on the original discrimination can readily be interpreted 
within Zeaman and House’s theory, though it could scarcely be said to 
be predicted. If the response of attending to the relevant dimension is 
once overlearned to the point where it occurs with approximately prob- 
ability 1 on every trial, further training would be wasted in the sense of 
contributing neither to performance on the given discrimination nor to 
positive transfer on new problems.* 

It is quite possible, however, that if the criterion of mastery of an 
original discrimination is not too stringent it may be met at a point 
when the probability of the correct observing response is still some- 
what short of unity. Zeaman and House, like Sutherland, assume that 
the rate of learning of the observing response is generally slower than 


This property of the theory, however, is peculiar to the particular acquisition Hims 
that have been chosen. If, for example, the assumptions were modified slightly so as 100 
incorporate the stimulus fluctuation process of ulus sampling theory wes Estes 
1955), then continued training would increase the likelihood both that the ce run Ss. 
sponse would be retained over a long period of time and that it would be transfer 1 
new situations which share something in common with the old one. passe da 
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that of the instrumental response, so one > 
learner has just met a modest criterion of mastery of an initial discrimi- 
nation, he would generally be in a state such that probability of the 
correct instrumental response to the positive cue would be near unity 


but the probability of the correct observing response would be some- 
what less than unity. In that event, 
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4. Discrimination Shift 


A large portion of the literature on transfer of discriminations has 
been concerned with comparisons between reversals and other types 
of shifts. This emphasis seems to be based on other than logical con- 
siderations, for the reversal situation is particularly difficult from a 
theoretical viewpoint and not amenable to a satisfactory analysis in 
terms of any extant theory. In the case of intradimensional shift, pre- 
sumably the only possibility of transfer is a carry-over of the observing 
or attentional response to the new situation. There can be no positive 
or negative transfer of the instrumental response, since the cues to 
which it has become associated in the first problem are not present in 
the second. On an extradimensional shift, possibilities are either nega- 
tive transfer or no transfer depending upon particular conditions. In 
the common case when the relevant and irrelevant dimensions are 
negative transfer would be expected, since the 


simply interchanged, 
d during original learning would be evoked 


observing response acquire 
by irrelevant stimuli on the transfer task. The cues to which the instru- 
mental response had been learned would also be present in the 
transfer problem, but now uncorrelated with reward so they would ini- 
tially yield correct responding only at a chance level. 

If all of the cues of the original discrimination are replaced with new 
ones in an extradimensional shift, then there is no basis in the atten- 
tional theory to predict any difference in speed of discrimination 
learning as compared to learning of the original problem. 

In the case of reversal, both positive and negative transfer should be 
expected and thus the net outcome must depend on the balance of the 
two. Since the assignments of reward and nonreward to the formerly 
positive and negative cues of the relevant dimension are simply inter- 
changed on reversal, the correct instrumental response will normally 
have zero or near zero probability at the beginning of reversal learning, 
and this factor by itself would lead to an increase in error rate as com- 
e original problem. Since this was the only 


pared to early trials on th 
form of transfer envisaged in classical association interference theory, 


the earliest studies showing faster learning under reversal than under 
original learning conditions were quite surprising. 

In terms of the attentional theory, we should of course expect the 
correct observing response to be carried over from the original to the 
reversal problem since the relevant dimension is the same, and this 
factor by itself would produce positive transfer. In general there is no 
basis for predicting which of the two opposed sources of transfer 
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should dominate in any situation, and by suitable choices of parameter 
values in the attentional theory either positive, negative, or no transfer 
can be predicted. 

In the study of House and Zeaman (1962) discussed above 15 
learning under reversal and intradimensional shift proved approxi- 
mately equal, and both were superior to that under extradimensional 
shift. A similar result was obtained by Eimas (1967) with normal chil- 
dren, the same relationships holding for both overtrained and nonover- 
trained groups. More often, however, ov 
reversal more than 
Youniss ( 


speeds of 


ertraining is found to facilitate 
other types of shift. For example, Furth and 
1964) found for normal childre 


group, intradimensional shift was superior 
mensional shift, the latter two being appr 


an overtrained group intradimensional shift 
superior to extradimensional shift. 
fits the same general picture. 
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A study by Campione et al. (1965) 
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example, studies conducted by Mackintosh (1965) with 
formly failing to appear in similar experiments conducte 
and his associates (Beck, Warren, & Sterner, 1966; Warr 
1967) with cats, and so on. 

A major possibility to be considered is that special susceptibility of 
reversal learning to the effects of overtraining has to as me En 
that are in a sense extraneous, that is, that one should not expect to be 
incorporated into a discrimination theory. One Possible kacken gue. 
gested by D’Amato and Jagoda (1961) js th or 


at during Prolonged over- 


sive reversals are in some 
ble to reach the stage at 


nin a single trial. However. 
n found consiste 


the overlearning revers 


the phenom- 
earing in, for 
rats, but uni- 
>d by Warren 
en & Warren, 


4. Discrimination Shift Studies 179 


training the subject has no experience with the negative stimulus, es- 
pecially in the positional discrimination, so that some forgetting of the 
consequences of the incorrect instrumental response may occur. The 
result would be that after much overtraining the individual’s tendency 
to avoid the negative cue would be reduced and thus there would be 
acquisition of the instrumental response of 


approaching the formerly negative cue In the reversal problem. 

Similarly, the possibility comes to mind that during the long series of 
correct responses during overtraining there may be some habituatory 
decrement which would tend to increase the tendency to spontaneous 
reversal. This notion would fit in particularly well with the pattern of 
since it is well known that that animal tends sponta- 
a positional response following a consistent se- 
“alternation tendency” is less 
es which vary in position. 


less negative transfer to the 


results in the rat, 
neously to reverse 
quence of rewards whereas the same 
prominent in the case of response to cu 

The finding that improvement in reversal learning with overtraining 
with human subjects than with animals, and 


occurs more regularly 
an subjects, has been taken by 


more with older than with younger hum 
the Kendler's (1962) as support for a mediational theory, since the most 
clearly documented instances of mediating responses are verbal, and 
verbalization in children increases on the average as a function of age. 
The possibility needs to be ruled out empirically, however, that older 
children are also more susceptible to habituation (or, to put it another 
way, boredom) than younger children and for that reason are more 
ready to reverse instrumental responses. 

A number of other comparisons of reversal with other types of shifts 
in different types of learners can be taken to support the Kendler's in- 
terpretation in terms of developmental stages of mediating responses, 
but at the same time do not exclude the other type of interpretation in 
terms of a habituatory process. For example, Kendler and Kendler 
(1959) found that within the same group of children faster learners 
showed an advantage for reversal over nonreversal shift, whereas the 
converse was true for the slower learners. (Here nonreversal shift in- 
volves making the previously relevant dimension constant in the new 
problem and the previously varying. irrelevant dimension, relevant). 
Several other studies have shown the advantage of reversal over non- 
reversal shift to increase with overtraining within groups of children of 
the same age- Sanders, Ross, and Heal (1965) found no differences 
between the reversal and extradimensional shift for retarded children 
but a difference in favor of reversals for normal children, and other 
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studies have shown that for normal children the advantage of reversal 
over extradimensional shift increases with age. N — 
The difficulty in evaluating alternative interpretations ol the € a a 
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attend to the relevant cue of a relatively difficult visual discrimination 
all mastered the discrimination, whereas nearly all of the group given 


co ve i a 12 ë 8 aini 
nventional trial-and-error training on the same problem made no 


measurable progress whatever. 

At the theoretical level, it seems to me 
vote much energy at the present stage to suc 
cide between different extant theories such as the attentional theory of 
Zeaman and House and the mediational formulation of the Kendlers. 
Both of these formulations are proving extremely useful aids to re- 
search, but as l have indicated above, it is probably best at this point 
to think not of alternative theories, but rather of a number of different 
mechanisms ciative, attentional, mediational, all of which may 
have places in a viable theory of discrimination learning. 

The attentional formulation of Zeaman and House is a particularly 

tical efforts in this area at the 


apt starting point for further theore’ 
further than others in spelling out a 


an be brought to quantitative test. 
purely theoretical work needs to 
tions or subsets of as- 


quite beside the point to de- 
h matters as trying to de- 


moment because they have gone 
specific set of assumptions which © 
In the immediate future, considerable 


be done by way of changing particular assump 
and working out the consequences of these 


Zeaman and House have 
is means it may be possible 
relatively robust 


sumptions in their theory 
by computer-simulation methods much as 
done for their own initial formulation. By th 
features of their theory are 
ered in order to circumvent some 


es that have arisen in interac- 


to gain insight as to which 
and which features can readily be alt 
of the difficulties and apparent paradox 


tions with research to date. 
Two basic issues with regard to Zeaman and House’s formulation 


are in particularly urgent need of clarification. The first is the question 
whether the first stage of the two-stage process is correctly identified 
as one involving observing responses, OT attention as the concept is 


ordinarily conceived, as distinguished from a process of coding or 
The second issue, presuming that further 


attentional mechanism of the gen- 
d House. is whether the present as- 


selective memory storage. 
analysis continues to implicate an 
eral type conceived by Zeaman an 
sumption of a closed set of attentional responses is correct. 

Some of the difficulties and apparent paradoxes that have risen in 
applications of Zeaman and House’s model, and which have been met 
with various ad hoc suggestions, are intimately tied up with the as- 
sumption that the individual is always attending to some dimension in 
any situation, the consequences of learning operations being simply to 
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shift attention from one dimension to another. An alternative is to as- 
sume that at any time an individual may be attending to some or to 
none of the stimulus dimensions represented in a situation. On the 
latter type of assumption, learners of higher deve 
could be assumed, as seems intuitively natural, 
bilities of attending to all of the 
tion than those of lower leve 
than hinder discrimination | 
viduals. 


lopmental levels 
to have higher proba- 
dimensions of an experimental situa- 
l, and this difference could facilitate rather 
earning on the part of the “brighter” indi- 


XIV. General Comments on Methodology 


The general impression created by this review of empirical and theo- 
retical literature is that a discipline having to do with the development 
of learning ability has only begun to take form. Some of the essential 
ingredients are beginning to appear, but major efforts up to the present 
have been concentrated upon a few facets of a problem which needs 
broad-scale and sustained attack. 

It is tempting to seek grounds for optimism in the impressive quanti- 
tative increase in research on learning abilities and disabilities during 
the past few years. Faith in the idea that the urgent practical problems 
of this area will be solved by sheer industry must be tempered, how- 
ever, by observation of the rapidity with which diminishing returns set 
in as the volume of work using the same methods and concepts is ex- 
panded. The analyses set forth in this volume almost without exception 
point to the conclusion that major advances at both the conceptual and 
practical levels come from close interactions of theory and experiment. 

Perhaps one of the most pervasive conclusions arising from an ex- 
amination of research within the framework of contemporary learning 
theory is the impossibility of sharply dissociating learning abilities 
from the results of learning. In every type of learning situation which 
has been subjected to considerable experimental study one finds that, 
when an individual of a given age is exposed to a learning task, both 
what he learns and how he learns it depend in major ways upon the 
results of past learning in other situations. Though relevant empirical 
evidence is meager, there are some theoretical reasons for assuming 
that individual differences in measured rates of learning must depend 
in part upon individual differences in elementary capacities which are 
to a major extent genetically determined. There are equally cogent 
theoretical grounds, with much more substantial empirical support, for 
believing that at least in the case of human learners, individual differ- 
ences in ability at any age also depend strongly upon cumulative ef- 
fects of past learning as embodied in learning sets and strategies. 

This conclusion leads one to envisage two main tasks for those cur- 
rently interested in advancing a science of learning ability. 
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1. The first of these, logically at least, involves the identification of 
elementary capacities and the tracing of the course of their maturation 
in the developing organism. Quite possibly, major the 
on this line will have to wait upon the advance of behavioral and neuro- 
physiological analyses of simple forms of learning and related be- 
havior-genetic analysis. At a more molar level, the comparative ap- 
proach to the study of mental retardation espouse: especially by Ellis 
(1969) is presumably directed at least in part toward locating basic 
capacities by comparing the learning of normal and retarded individ- 
uals in standard laboratory situations. 

Although there seems to be no satisfactory 
comparative approach, it is possible 
cess for a few particular lines of re 
work which are influence 
Perhaps the most import 
process revealed by con 
within mathematical lear 
and some within the com 


oretical progress 


general rationale for the 
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search pursued within this frame- 
d also by developments in learning theory. 
ant case in point is the short-term memory 
verging lines of investigation, some arising 
ning theory (e.g., Atkinson & Shiffrin, 1968) 
parative approach to mental retardation (Ellis. 
1969). It must be emphasized that in both cases this process has been 
isolated only as a result of intensive experimental and quantitative 
analysis. For those with major interest in understanding individual 


differences, it may be a bit of a disappointment to find that short-term 
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this instance the necessary auxiliary tools will presumably have to be 
drawn from concurrently developing theories of learning and cognitive 
processes. 

Some of the considerations involved are well illustrated in a recent 
work of Kendler, Kendler, and Marken (1969) on developmental trends 
in the use of representational responses by children in dealing with 
simple discrimination and concept learning problems. The procedure 
which appeared to be the principal reliance of many earlier investiga- 
tors in this area, namely the amassing of data for performance on the 
same standardized task by children at different age levels, has proven 


all but barren for theoretical purposes. One can find much more 


promise for the future in the studies, so far few in number, which 
ls. either relationships between perform- 


ances on different tasks or processes inferred from performance within 
a theoretical framework. As illustrated in the Kendlers’ work, this 
latter type of investigation will often require the devising of novel ex- 
perimental procedures dictated by theoretical considerations (e. g., the 


half-reversal shift in the 1969 study). Frequently there will also be re- 
quired some modification of the basic learning task as it is applied at 
s in performance attribut- 


different age levels in order to isolate change 
able to the process under investigation from those attributable to other 
processes which are correlated in the course of normal development. 
To what extent can one assume that broadly conceived programs of 
theoretical and experimental attack upon the general problems of 
learning ability and mental development will solve the practical prob- 
lems of dealing with the mentally retarded and other individuals with 


more special learning disabilities? In attempting to come to grips with 
this enormously complex question, I have found it useful to think in 
Zigler (1967, 1969) who distinguishes 


terms of the schema proposed by à 
between “pathological” and “normal” cases of low intelligence or 
learning disability. If it be granted that performance on any learning 

genetic and experien- 


task is the result of some complex assemblage of l rien 
tial determiners, then one must expect to find in any population of indi- 


viduals a more-or-less normal distribution of scores no matter how 
learning ability is measured. The individuals who fall toward the 
bottom of the distribution may Pose special problems for society, but 
from a theoretical standpoint they differ only quantitatively from the 
individuals we term “normal” or “superior.” 

The understanding of the individual differences in manifest learning 
ability that arise throughout the range of this normal distribution must 


compare, at different age leve 
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be part and parcel of the general problem of understanding 1 
zation of learning processes in the individual and the way in which this 
organization develops as a function of specific kinds of 5 
Further, I think that any substantial advances in improving t ie 
learning performance of individuals who fall at the lower end of this 
distribution must be expected to come from general theoretical ad- 
vances, rather than from comparisons of individuals falling at the 
middle and lower portions of the distribution on various particular 
tasks. Such comparisons have their uses, as we have observed, but ! 
have been able to find no rational basis for the heavy emphasis 
comparisons between “normals” and “retarded” rather than between 
“normals” and “accelerated” individuals. In fact there are reasons to 
think that the latter type of comparison might often be the more fruit- 
ful, since it is generally more advantageous to study a trait or process 
by finding cases in which it is exemplified to an unusual 
concentrating on cases in which it is poorly re 

The residual class of learning 
including individuals w 
have suffered from spe 


degree than by 
presented or absent. 

disabilities in Zigler's schema is that 
ho exhibit identifiable organic defects or who 
cific abnormalities of experience, for example 
extreme social deprivation in early childhood. 
must necessarily be so he 
scarcely expe 


These pathological cases 
terogeneous in character that one can 


ct to accomplish much in attempting to deal with them on 
general theoretical principles. At a rese 


arch level, comparative studies 
must be important, but it would se 


em that the comparisons should be 
concentrated, not on differences between these individuals and nor- 


mals, but rather on differences between the performance of these indi- 
viduals and that required for social adjustment. The focus of attention 
needs to be, not on the locus of the disability of a particular individual, 
but rather upon his remaining resources w 
upon to compensate for the particular pathology. 

Of the various schools of investigators engaged in research upon 
learning in the mentally retarded, those associated with the operant 
conditioning tradition seem to take 


most seriously the strategy of ana- 
lyzing the performance of the retarded individual, comparing it in de- 


tail with that desired, and generating specific research programs calcu- 
lated to yield information as to how to attain the desired resultant. At 
the present time, the specific methods of 0 


plicable largely to problems th 
whom it is a major accomplis 


hich might be capitalized 
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Though some of the accomplishments obtained by application of 
operant conditioning methods to the severely retarded seem quite use- 
ful, indeed impressive, perhaps even greater fruits could be hoped for 
by application of the same general strategy at the level of the educable 
retarded. The vast preponderance of learning research at this level 
seems to have been concerned primarily with comparisons between 
retarded individuals and normals in various simplified laboratory 
learning tasks. It is very difficult to find cases in which investigators 
have analyzed the deficient performance of the higher-grade defective in 
school learning or other situations involving cognitive or verbal func- 
tioning, proposed a specific hypothesis as to the deficiencies in the 
previous learning history, and developed individual training programs 
which, on the basis of the best available learning theory, might be cal- 
culated to lead to the desired performance. 

With even the meager beginnings of a theory of cognitive develop- 
ment now available, it seems clear that one could prescribe an environ- 
ment and a sequence of learning experiences for a normal child which 
would lead him in adulthood to exhibit discriminative and problem- 
solving behaviors of the kind which characterize the mentally retarded. 
Less is known as to the type of environment which might be expected 
to lead any except perhaps the severely brain-damaged child to de- 
velop the learning strategies which now characterize normal adults, 
and still less as to the conditions, if any, which might produce this ee 
sult in the case of the mentally retarded adult. However, 80 much is at 
stake from both theoretical and practical viewpoints, that major re- 
search efforts directed toward these possibilities appear to be called 


for. 


nat the amounts of experi- 
earning strategies in the normal 
hose provided in ordinary labo- 


It must be emphasized in this connection tk 


ence involved in the development of l 


child must be very large compared to t Í 1 y lab 
ratory experiments. Despite the formidable practical difficulties in- 


volved, high priority must attach to the need for conducting sufficiently 
dies to provide substantial evidence con- 
ts in discriminative learning and 
the mentally retarded. 

review, one must 
heory and the 


long-term experimental stu 
cerning the reversibility of the defici 
cognitive abilities which characterize 

With respect to the principal motif of the present 


conclude that, on the whole, contacts between learning t 
t have been sparse and unsys- 


empirical study of mental developmen 
tematic. Efforts toward more effective interplay have so far appeared 
almost exclusively in connection with mental deficiency and learning 


disability. Here the results have been quite encouraging, with particu- 
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larly fruitful indications of progress 
arising from sustained and intensive 
of short-term memory, discrimination learning, and or, 


retrieval strategies in verbal recall. 
Reasons for the relative j 


in both theory and application 
theoretical-experimental analysa 
ganization an 
solation of the two disciplines in the a 
ne study of mental development has 1275 
m primarily observational to e eee 
methods. On the other hand, learning theory, in this country at les 
d from the outset with laboratory studies 0 
ons which surely have nothing to do with anf 
tions, the subjects of laboratory analyses - 
learning were for a long period confined largely to human adults and a 
few species of lower animals. Naturally enough, the concepts and ae 
arning theories reflect these limitations 2 
or expressing ideas regarding developmenta 
processes. 


> wn that diagnosing a weakness in scientific 
Strategies rarely leads i 


irect way to reform, but even so there 
are some grounds for guarded optimism in the present instance. At 
already be taking form in the increasing 
phenomena of learning and memory, with 
ed to deal with the organization of learning 
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